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COMMENTS ON THE APPROXIMATE SOLUTION 
OF THE DIFFUSION EQUATION: II 


Joun Z. HEARON* 


DEPARTMENT OF PHYSIOLOGY 
THE UNIVERSITY OF CHICAGO 


On the basis of a previous general formulation (Bull. Math. Biophysics, 15, 21-29, 1953a) 
a discussion is given of the error in the approximation method of N. Rashevsky. This error, 
inherent in the method when the metabolic rate is different at each point in the cell, is dis- 
cussed in detail and numerical values are presented for two particular cases: the rate propor- 
tional to the concentration and the rate a prescribed function of the spatial coordinates. It is 
shown that the formulation for the first case also applies to several other cases, that the error 
is negligible provided the rate is sufficiently small, and that the error is fairly sensitive to the 
cell size. If the rate depends upon the coordinates alone a small rate is not sufficient to insure 
a negligible error. The relations between the exact method, the standard approximate method, 
an earlier approximate method (Physics, 7, 260, 1936), and a more recent refinement (Bull. 
Math. Biophysics, 10, 201, 1948) of the standard method are discussed. 


Introduction. In a previous paper (Hearon, 1952b) it has been shown 
that the solution of the diffusion equation, V?C = —Q/D,, for the steady 
state, spherical cell is the solution of the integral equation 


r r, G(r) G(r) 


oz = 
apie + 3D, et D;7 Dzr ; 


C (4) .C° 4+ (1) 


Here Q, the rate of production per unit volume, is in general a function 
of C and 7, and (1) is valid under quite broad conditions on Q. The in- 
ternal diffusion coefficient, the external diffusion coefficient, the permea- 
ability, and the radius of the cell (all assumed constant) are denoted by 
D,, D., h, and ro respectively; C° is the external concentration at infinity 
and 


G(r) = f (r= ») 00) dr. (2) 
0 
In (1), and in what follows, bars denote averages over the volume of the 


cell, i.e., for any ee 
= TT ia 
iF; F — 3 f wey . 


* This work was performed while the author was a research participant, Oak Ridge Institute 
of Nuclear Studies, assigned to the Mathematics Panel, Oak Ridge National Laboratory. 
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Here Q, the rate of production per unit volume, is in general a function 
of C and r, and (1) is valid under quite broad conditions on Q. The in- 
ternal diffusion coefficient, the external diffusion coefficient, the permea- 
ability, and the radius of the cell (all assumed constant) are denoted by 
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From (1), C is obtained as 

C=C°+ (A+e) (3) 
provided that 0 ¥ 0, where A is 


2 
a0 To Yo 4 
A=3,+3p,* 15D, (4) 
and ¢ is 
net 
= To 1— 720 5 
<= i007, | 70 |. (9) 


If this same problem is solved by the approximation method of Rashevsky 
(1948a) the result, except for the numerical factors in A, is identical with 
(1) provided the quantity ¢ vanishes. Exclusive of the numerical factors in 
A, then, the approximation method is, in principle, in error except when 
Q is constant. 

It is the purpose of this paper to discuss in some detail the value of e€ 
for two particular functional forms of Q. The method here is of strictly 
limited utility, being based upon knowledge of the exact solution which is 
available in only a few cases. Of more service would be an endeavor to 
secure general, if approximate, statements regarding e based upon those 
broad features of the exact solution which can be deduced from very gen- 
eral properties of Q (Hearon, 1953b, references and discussion therein). 
However, in what follows certain generalizations emerge. It is also shown 
that Case I, below, applies exactly in three different physical situations 
and approximately in a fourth. Some comments on an earlier method, due 
to Rashevsky, and a later refinement (Rashevsky, 1948b) of the standard 
approximate method are given. 

Case I: Q = —kC(r);k > 0. For this case 


AC oy ae (6) 


where A is a constant, the value of which we will not need, and a? = k/D,. 
It is evident that °0/r0 = PC/rC and this ratio is given by 
FO JO rsinh ar dr (7) 
i ped et oh Eee oS ot 
yr? Q To é * 
gr th rsinhar dr 


Using standard integrals we obtain, after some rearrangement, 


+ 


_ = _ 2 tanhe a 
eestenh a) a?) 


(8) 
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where « = aro. From (5) and (8) 


7 r [eee ae 
1SD;lx—tanhx x 
(9) 
re 
=T5p, pene | 


It will be noted that the factor in (9) which multiplies £(”) is identical 
with the last terms of A, i.e., the term contributed by internal diffusion. 
We will discuss the function £(c) which is the correction term, e, expressed 
in multiples of the internal diffusion term of A. 

It can be shown that 


Lim E(x) = -1 (10) 
and, cf. (5) and (13), that 
Lim €(*) =0. (11) 


For small values of x, 720/720 may be computed from the series ratio 
p> (29+ 1)1(2 7-5) 
Se 


G7 150743) 


iatres 


Fe 


(12) 


no 


x4 


5x? 
eens as (13) 


fF taeat 


Table I gives the values of £(%) for some chosen values of x. ij 
Since £(”) has a constant sign, the approximate method gives values of C 
which are always too great. Large values of &(x) tend to minimize the 
contribution from internal diffusion. For x < .85, the value of &(«) may 
be regarded as negligible. If it is assumed that k ~ 10~* sec.-' (correspond- 
ing to a half-life of ~ 69 sec.), and D; ~ 10-7 cm.’ sec.", then for fo ~ 
3 X 10-3 cm. (Arbacia egg), « ~.95. Under these conditions £(x) 
amounts to about 2.5% of the internal diffusion term. For these same 
values of & and D; if ro is increased only three-fold, (i.e., ro ~ 10-*), ba 
3.15 and £(x) rises to about 20% of the internal diffusion term. With 
D;~ 10-7 and rm ~3 X 10-3, x ~ 10+/g. It is readily seen that, for 
k = 1, &(x) = 60% of the internal diffusion term. It will be recalled that 
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for a simple enzyme system Q = —VmnC/Km, for small C, where K,, and 
V,, are the Michaelis-Menten constant and limiting velocity (cf. Hearon, 
1949). Since k = Vn/Km it is possible to have very large (= 10) &, al- 
though it must be admitted that this will depend upon the concentration 
of enzyme which enters V,, as a factor. ; 
Clearly the importance of the correction ¢ will depend upon the im- 
portance of the term 73/15 D,; in A. Further the seriousness of the error 
in the approximation method will depend upon the particular application. 


TABLE I 
Values of &(x) = 5 tanh «/(x-tanh x) — (15 4+ 2*)/2x?; x = aro = VRro/ VD;- 
% — £(x) 
© 1.000 
10.00 0.5940 
5.00 0.3500 
4.00 0.2720 
Re) 0.1990 
3.00 0.1850 
2.00 0.1000 
1.50 0.0434 
1.00 0.0277 
0.95 0.0251 
0.85 0.0199 
0.60 0.0098 
0.45 0.0029 
0.30 0.0026 
0.10 0.0004 
0.00 0.0000 


For example the propagated error in C, from (3), may be quite different 


from that in Q: 
* — kC 
CT Tae” 


It is not difficult to perceive that for Q = —kC(r), a small value of & 
results in a concentration distribution such that C(r), and hence Q, does 
not vary greatly with r. The above discussion and Table I show that, for 
given 7o and D,, it is required that k be sufficiently small (x ~ 1.00) in 
order that the approximate and exact solution for C agree to the extent 
originally claimed for the approximate method. 

Case IT: Q(r) = Are~*. It has long been realized (Rashevsky, 1948b) 
that the standard approximation method fails if the concentration is not 
monotone in the two principal directions of the (assumed) spheroid cell. 
The error which we are discussing would be expected to be relatively large 
in such a case if Q depends upon C. It might also be expected that this 
error is more serious when (Q is not monotone in 0 < r S 7p even though 
it has been shown (Hearon, 1953a) that C(r) is monotone if Q does not 
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change sign in 0 < r < ro. We will discuss such a case on the basis of an 
assumed hypothetical form for Q. It will not be possible to discuss what 
may reasonably be expected as a value for € unless some physical meaning 
is assigned to the parameters in Q. If Q(r) has the form 


Ole) = pec. (14) 


where the units of A are mols. sec.-! cm.-‘, and those of py are lange hy 
will be necessary to assign values to v7» only. In this case 


rO arf oredr (15) 
a 120 — [x§+ 544+ 20x3+ 60224 120x+ 120] e-? 
3x? [6 — (x8 +4+3x%2+ 6x+6) e*] , 


where « = vro. Now the function Q(r) has a maximum at r = 1/y and 
we may require that this value of r be some multiple, 1/p, of ro. Then 
p=vro = x, and if p = 1, Q(r) is monotone in 0 Sr S 1, while if 
p <1, Q(r) has a maximum in 0 S r < 7p. Thus by suitable choice of x 
this case gives Q(r) monotone or not, and if not the maximum may be 
placed at any point in the cell. Again we deal with the function &(x) = 
3(1 — 70/r0)/2 so that ¢ is given by the second equation of (9). From 
(15) it is clear that 


Lim &(«) =3 


and it can be shown that as x 0, &(x) — —4. Table II gives the 
values of £(x) for some chosen values of x. 

It is seen that &(~) may be positive or negative and vanishes between 
4/3 and 2. Even though, for given A and 7», large v tends to make Q small, 
£(x) becomes fairly large and approaches 150% of the internal diffusion 
term. For « < 0.10, £(«) increases slowly but the values are quite difficult 
to compute even to one significant figure. If we require that 1/v = ro/p 
then for increasing « > 1, the maximum point of Q(r) moves toward the 
center of the cell. When the maximum is between 370/4 and 70/2, &(x) van- 
ishes. The behavior of £(x) does depend upon the fact that Q(0) = 0, 
for if we assume that O(r) = A(Q°/v + re~”) then 


PO _ Qoxd4+5 [120 — (x84 5et+ ... +120) eI 


16 
PO Q%xs-+ 3x2 [6 — (x8 +302+6x+6) e*l AN 


? 


» 
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from which it is clear that 
Lim E(x) =0, 


and it can be shown that £(x) remains positive and bounded as x — 0. 

Discussion. It is to be stressed again that the error discussed in this and 
the previous paper (Hearon, 1952b) is an error above and beyond, and 
generally more serious than, that due to the acknowledged, geometric 
approximations in the approximate method. 

It has been shown that if Q is proportional to C the error is negligible if 
Q is sufficiently small. As discussed, it is obvious that if Q is small, C(r) 
will not differ much from the external concentration and will not vary 
greatly from point to point in the cell. If Q depends upon C alone, then Q 
will likewise not vary greatly from point to point and from (5) it is plain 


TABLE II 
Values of &(x) = 3(1 — 7?0/r?O) from (15); « = vo. 
x (x) 
3/2 


0.9750 
0.4740 
0.1800 
0.0525 
—0.0180 
—0.0615 
—0.117 
—0.15 
—0.21 


—1/6 


8 


_ 


SD COCORrFNWUO 


Sss8essss 


that e will be small. This has been shown for Q = —AC and, while it 
appears likely that it is also true when Q is any function of C, this re- 
mains to be proved. It should be remembered that what is meant by “‘Q 
sufficiently small” will depend upon the value of 7» and that, at least in 
Case I, € is considerably more sensitive to ro than to k. As shown by 
Case IT, if Q = Q(r), a small value of Q is by no means sufficient to insure 
a small e. It does not seem likely that Q will depend upon r alone more 
often than upon C alone (or upon C and r) but this is a distinct possibility 
(cf. discussion Hearon, 1950, 1953a) and in such a case the error is likely 
to be appreciable. 

In any case the importance of the error will clearly depend upon the 
importance of the contribution of the internal diffusion term to A. Most 
often in practice it is permissible to set D, = ~, since the external 
medium is thoroughly stirred. In such cases A will consist of a permeabil- 
ity and an internal diffusion term. If the permeability term is dominant 
then the importance of e will be less than if the permeability and internal 
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diffusion terms are similar in magnitude (which is quite likely) or than if 
the internal diffusion term is dominant. 

The function &(«) of Case I applies to certain other, fairly different 
situations. For a first order reaction, %; — %2, it will be true that O1= 
—kC; = —Q2. Therefore (x) = &(«), where the £;(x) are from Case ie 
but, in general, €; ~ €, and A; ¥ A, and the importance of the two errors 
may be quite different. Similar considerations apply when there is more 
than one product. If the case 1 — a, — x3 is considered and if it is as- 
sumed that 2; disappears at a constant rate, g, and 2 at a first order rate, 
then 

Qi=—9G, Qs=q—kC:, Qs=kC2. (17) 
The exact solution for C(7), under the assumption D, = ~, has been 
given by A. T. Reid (1951). In the notation of this paper it is, if k ¥ 0, 


A’(q—kC2) sinh ar 
BC ra 


C2(r) =}- (18) 


where a is defined in Case J and A’ is the constant A of Case I evaluated 
for D, = ~. It is clear that e; = 0, and, since from (17) and (18) 


A’(q—kC3) sinh ar 
ee (19) 


that £(x) is (x) of Case I. From (17) and (18) it can be seen that if q is 
sufficiently small, &3(x) = &(x), where x is as in Case I, and that if k is 
sufficiently small £3(*) = 0. Otherwise 7°03/7?0; must be formulated 
from (17) and (18). It can be shown that this ratio is smaller than that of 
Case I and is in fact less than unity, thus: 0 < 7703/7703 S$ 1 S 7°Q:/rQ2. 
Therefore £; which is mot in general a function of x alone may vary from 
zero to 3/2. There is one further, special possibility: C2(r) has the property 
that if the external concentration C? has the value g/k then C2(7) has 
everywhere the value C}, and Q2 = 0. 

Mention should be made here of a method (Landahl, 1937; Rashevsky, 
1948a) which preceded the standard approximation method discussed 
here and in the previous paper (Hearon, 1953b). The basis for this 
method is the following argument: If for Case J, C(r) be expanded for 
small ar there results 


2 4 
C(r) = Aa [144 80 +...]. (20) 


If second order terms are retained in (20) and from this yC(r) is formulated 
the result is V?C(r) = Aa* = kAa/D;. But in general v7C(r) = —O/D:. 
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It is therefore asserted that approximating C(r) to second order terms is 
equivalent to replacing —Q, in the differential equation, by the constant, 
average value kAa. Here two points are clear. 

From (20) the value of C to second order terms is 


is (a ro) z 9: 1 
C= Aal 1+] (21) 
and (, to the same accuracy, is —& times (21). Secondly, the constant 
value, — Aa, is obtained for Q by the above procedure as a result of 
not retaining fourth order terms in (20) for the purposes of computing 
vC(r). Plainly, it cannot be maintained that C(r) = Aa[1 + (ar)?/3!] 


and at the same time that OQ = —&C(r) is constant. It is readily verified 
that if C(r) is to be estimated to second order terms then for consistency, 
since C(r) is a solution of V?C(r) = —a?C(r), fourth order terms must be 


retained in C(r) for the purpose of computing V?C(r). It is further obvious 
that if C(r) is approximated to zero order terms then C(r), and hence Q, 
is constant and there is nothing to discuss. The results of earlier work 
(Hearon, 1953b) and of this paper exhibit explicitly the relation between 
the exact method, the standard approximate method and the approxi- 
mate method discussed in the last two paragraphs. 

Finally, brief mention will be made of a refinement of the standard 
approximate method. This refinement (Rashevsky, 1948b) consists of 
dividing the cell into regions in each of which C is monotone. The rate of 
change of the amount of solute in each region is expressed in terms of 0 
for that region and the flow from those regions with which it is con- 
tiguous. Each region is then treated after the manner of the Householder 
tri-axial cell (1942). While this refinement has served certain formal pur- 
poses (Rashevsky, 1948b) it is certainly difficult to apply in the absence 
of the knowledge of the exact solution. If the regions are left unspecified 
as to exact size and location however, the method is straightforward (if 
tedious), represents the extension of the standard approximation method 
to the case where C is not monotone, and gives a gross distribution of 
the C; in the various regions. In this refinement of the standard approx1- 
mation method the mean concentration in the ith region, C,, involves the 
kind of error discussed here since each A; is subject to a correction of the 
type given by (5). It is reasonable to assume that the magnitude of this 
error is less for any sub-region than that for the cell as a whole (as reck- 
oned by the standard method) and that C = 2,C;V;/V, where V, is 
the volume of the ith region, agrees more closely with the exact value 
of C than that computed by the standard approximation method. 
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It is shown on the basis of (1) conservation of mass, (2) positive concentrations, and (3) 
the principle of detail balancing that periodic reactions cannot occur in a closed system de- 
scribed by /inear differential equations. The matrix, A, of the rate equations must be such that 
|A| = 0, ai; > Ofori #7, ai; < 0, and VAV— = B, where V is diagonal and B is symmetric. 
These properties of A imply that the latent roots are real and non-positive and that neither 
catalysis nor inhibition can be described by linear equations. It is further shown that periodic 
reactions cannot occur in an open system for which the matrix associated with the chemical 
reactions has the above properties and in which the simple law of diffusion is obeyed. The rela- 
tion of these results to Onsager’s reciprocal relations and to previous work on periodic and cyclic 
chemical reactions is discussed. The utility of certain of these results for the treatment of iso- 
tope kinetics is indicated. 


1. Introduction. The biological occurrence of periodic phenomena is 
well known. It is of interest to consider whether certain cases may be 
explained on the basis of periodic chemical reactions. It may be said that no 
case has yet been so explained, in a satisfactory manner, and that the 
formal problem of periodic chemical reaction is not highly developed. It is 
reasonable to assume that the development of the formal chemical 
kinetics of periodic chemical reactions must precede the solution of the 
associated diffusion problems. 

The possibility of periodic reactions was early considered (Lotka, 1910; 
Hirniak, 1911; Lotka, 1920) and an experimental case was shortly re- 
ported (Bray, 1921). A. J. Lotka considered a set of consecutive, auto- 
catalytic reactions in which the concentration of the first species is con- 
stant and showed that such a system exhibits undamped oscillations, i.e., 
the concentrations oscillate about a “central” value. This treatment has 
been generalized and extended by M. J. Moore (1949). In particular it was 
shown that the removal of the restriction that the concentration of the 
first reactant be constant leads to damping of the oscillations but in no 

* A portion of this work was performed while the author was in the Department of Physiol- 


ogy, University of Chicago, and was supported by a grant from the Dr. Wallace C. and Clara 
A. Abbott Memorial Fund. 
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case, as Lotka (1920) had shown, are the oscillations simple harmonic. 
The treatment was extended to systems which are open but otherwise of 
a simplified nature. The principal, theoretical content of the Moore (1949) 
paper was anticipated by K. F. Bonhoeffer (1948) who, however, con- 
sidered systems in which only two chemical species are involved. 

In all of the above cases, except that of Hirniak (Joc. cit.), the differ- 
ential equations which describe the system are non-linear, whether or not 
the system is open. In fact the open nature of the system contributes 
no non-linear terms when the usual flux laws are considered. M. F. 
Morales (1944) has discussed periodicity in pure diffusion systems in which 
chemical reaction plays no part. The flux law, based upon an extended 
form of the chemical potential, assumed by Morales is non-linear and re- 
sults in non-linear differential equations. In the following paragraph we 
refer to cases based upon /ineay equations. 

N. Rashevsky (1938, 1948c) discussed a case of an open system for 
which the chemical rate laws are the linear equations 


dx 

= = 1%, + Ay2X2 

F (1) 
oe 

a = 1X1 + Aq2X2, 


where 41, “2 are the concentrations of the first and second chemical species 
and the a;; are constants. A. M. Weinberg (1938) generalized the dif- 
fusion problem associated with this case and discussed solutions possessing 
certain symmetry properties. Rashevsky (1948a) has discussed more gen- 
eral solutions as well as the conditions under which the steady stale con- 
centration configuration exhibits spatial periodicity. The solutions dis- 
cussed by Weinberg (1938) and Rashevsky (1948a) depend upon certain 
restrictions on the diffusion and permeability coefficients but the solution 
of the diffusion problem is otherwise exact. By forfeiting exactness in the 
geometric description of the system and in the formulation of the diffusion 
problem, an approximation method (Rashevsky 1948c, 1940) allows the 
restrictive condition among the diffusion and permeability coefficients to 
be removed. Weinberg (1939) has solved the problem in this manner with 
the result that the frequency of oscillation is that of the fundamental 
mode whereas actually, in the exact solution (Rashevsky, 1948a), the 
oscillations are the superposition of a countably infinite number of fre- 
quencies. Rashevsky (1948b) has discussed the method by which this 
defect of the approximation method is to be removed. Finally, Rashevsky 
(1949) has discussed the possibility of centrally asymmetric concentration 
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configurations, which are subject to damped oscillation, leading to the 
division of a spherical cell and accounting for the damped pulsations which 
immediately precede cell division. 

In the cases discussed in the above paragraph the existence of periodic 
solution depends direcily (Rashevsky, 1938; Weinberg, 1939; Rashevsky, 
1948b, c) or in an important manner (Weinberg, 1938: Rashevsky, 1948a, 
1949) upon the properties of the coefficients in the linear equations, (1), 
for the chemical rate law. 

2. Purpose of the paper. It is the purpose of this paper to inquire for the 
restrictions set, by general chemical and thermodynamic principles, upon 
the coefficients of the rate equations for a general linear chemical system. 
The implications of these restrictions for the existence of periodic solu- 
tions are examined and discussed for closed and open systems. 

In particular it is the purpose here to show that if A is the matrix of the 
chemical rate equations [of which (1) is the second order case] the prin- 
ciple of conservation of mass and the requirement that the concentration 
be non-negative are sufficient to determine the signs of the elements a;;. 
These signs are such that in no second order case can the latent roots of A 
be imaginary. In the general, mth order case, the principle of detail balanc- 
ing (the law of entire equilibrium of G. N. Lewis) or the thermodynamic 
theorem that the (Gibbs) free energy is a perfect differential imposes ad- 
ditional restrictions upon the a;; such that A can be transformed, by a 
similarity transformation, into a real, symmetric matrix. With these 
properties of A established the reality and non-negativeness of the roots 
follow at once. The relations of these results to similar, special theorems 
proved by I. Opatowski (1945, 1946) and to the general linear problem in 
chemical kinetics are discussed. 

3. The closed, uniform, homogeneous system. By a linear chemical system 
we will mean a system consisting, in general, of m chemical species, X;, 


i = 1,2,...,, for which the formal chemical kinetic rate equations are 
i (t) = >) asses (2) , i=1,2, asain ay (2) 
j=1 


where x; is the concentration of the ith species, X;, the a;; are constants, 
and x, denotes the time derivative dx,/dt. This terminology, of course, 
bears no relation to the “physical” linearity of the system; for a straight- 
chain, branched or cyclic system may equally well obey (2). We require 
only that (2) be linear. If the species, X; ,are converted one into another 
this assumption of linearity means that the chemical reactions are first 


order. They are not necessarily unimolecular. 
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We assume, initially, that the species interact in any way whatever. 
Then, if one species, Xz, is to influence the rate of change of another, X;, 
it must do so by virtue of contributing a linear term, a, to the equation 
for «;. It will then be shown that the only real, physical interpretation of 
(1) is that the X, are converted chemically one into another. 

The most obvious restriction which the assumption of a closed system 
implies is that of the conservation of mass. This evidently requires, if the 
volume V of the system is constant, that there exist positive numbers, a, 
such that 


YS at=L> a;x,;=0. (3) 


In particular, if the «; be expressed as mass per unit volume, rather than 
moles per unit volume, the value of every a; may be taken as unity. 
The fact that a; > 0, for all z, follows from simple chemical principles: 
The a; can always be chosen as multiples of molecular weights of the X; 
when the x; are in moles per unit volume. In cases of simple stoichiometry, 
which are most likely to obtain when (2) is valid, the a; can be chosen as 
small integers which are ratios of the molecular weights. 

It is worth noting here that V cannot, in principle, remain constant 
unless the partial molal volumes of the X ; are identical. The formal laws 
of chemical kinetics are, however, based upon the assumption of dilute 
solutions in which case V is essentially the volume of solvent and for most 
purposes, including those of this paper, V may be regarded as constant. 
We are clearly assuming here that, in addition to being dilute, the solutes 
are perfect. 

From (2) and (3) it follows that 


Lad a= DL aas)es=0. (4) 


Since (4) must be valid for a// values of the «;, we have proved 
Statement I: The conservation of mass requires that the matrix, A, of 
the set (2) be singular 
| A] =1e,.3| =0; io 


The linear dependence of the rows of A, or of the determinant |a,,|, is 
expressible in terms of positive constants: 
Dd) aiais=0, the We be 


: (6) 
a> Oy dehy 2) esate 
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From (6) it is evidently impossible that all of the a;; be of like sign. A 
more restrictive statement regarding the signs of the a; can be derived as 
follows. The concentrations x; are, of necessity, non-negative. Further, 
their initial values are otherwise arbitrary. Let the initial conditions be 


BOY = 65.29" Bose s 2 hare go C7) 
where 6; is the Kronecker delta. Then, from (2), the initial rates are 
x; (0) = a i= 1, Ze see y N., (8) 


The implication of (7) and (8) is that if ain <0, i # m, then x,(t) < 0 
for some # > 0. The time, t, is of course to be reckoned non-negative. Thus 
(6), (7), and (8) lead to 
Statement II: The requirement that the «,; be non-negative demands 
that the off-diagonal elements of A be non-negative 
Gee 0, te G 3 ty 7 = 1, :2y5.., 2. (9) 


This requirement, with that of the conservation of mass (Statement 1), 
implies that the diagonal elements of A are non-positive: 


2 2 Sy ey pasal De eons (10) 
qi TAl 

In particular, if any a;; vanishes the column vector, in A, in which that 
element stands is the null-vector and the 7th species is an inert solute with 
respect to the others. 

Statements I and IT severely restrict the possible physical interpreta- 
tions of the set (2). A simple argument supports 

Statement IIT: lf (2) describes a chemical system in which mass is con- 
served, the only physical interpretation of (2) is that each of the chemical 
species, X ;, is stoichiometrically converted to, and formed from, the others. 

For (2), (9), and (10) state that the rate of change of the 7th species is 
the sum of one negative term and several positive terms. The negative 
term expresses that X; disappears at a rate proportional to x,. By (10) 
and (2), this rate of disappearance is, on a mass basis, numerically equal 
to the sum of the terms contributed by X; to the rates of appearance of the 
other species X;, (7 #7). The situation is most transparent when the 
special initial conditions (7) and initial rates (8) are considered. It is then 
evident that under the conditions that every «; is zero except one, viz., 
4m, the ith species (i ~ m) appears, (9), at a rate proportional to «» and 
the mth species disappears, (10), at a rate proportional to mn. Moreover, 
by (8) and (3), the rate at which X,, disappears is, on a mass basis, the 
same as the sum of the rates at which the X;, (7 # m), appear. 
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The term “‘stoichiometrically,” in Statement ITI, is used to imply the 
absence of catalysis. It has been previously noted (Morales, 1944) that 
the interpretation of (1), as employed by Rashevsky and by Weinberg 
(references in introduction), is that of catalysis without combination. 
This interpretation is, as Morales noted, plainly unrealistic and has been 
here excluded. Further, it has been shown that negative catalysis (inhibi- 
tion) cannot be expressed by the linear set (2); for a negative, off-diagonal 
element in A implies the existence of negative concentrations. (Statement 
II). 

It is well known from the theory of ordinary differential equations that 
the nature of the solutions of (2) is determined in general by the nature 
of the latent roots of A. Specifically, if the roots of A are distinct, the solu- 
tions are 


t= hem 4h, 2 ee sees (11) 
k=1 


where the gi depend upon the initial conditions and upon the ; which 
are the roots of the equation 


[A= FpeOs (12) 


Here J denotes the unit matrix of order ». If one or more pairs of the i; 
are complex, (11) can be written as a real, periodic function of ¢. It is to 
be anticipated, on the basis of physical intuition, that for a closed system 
the real roots and the real parts of the complex roots will be non-positive, 
for the x,(/) must be bounded for all ¢. According to (3), La.x,(/) = 
Ya,x} = constant is finite if the mass of the system is finite, and since 
every a; > 0, every x,(¢) is bounded. But this is impossible unless the real 
parts of the roots of A are non-positive.* Moreover, we are insured by 
(5) and (12) as a direct consequence of the conservation of mass of at 
least one zero root. It is to be noted that condition (5) could have been 
secured by insisting upon the existence of non-trivial, steady state solu- 
tions of (2). This, however, would not furnish the restrictions a; >0, 
a= 1,2,...,m, of (6). Consequently, while (9) could be deduced, (10) 
would not follow. The conservation of mass insures, by virtue of (5), non- 


* Although this reasoning is sufficient it should be noted that from a known theorem 
[Brauer, 1946; theorem (1) p. 389] it can be shown that the real parts of al] roots are non- 
positive. 

t The case of an r-fold zero root requires special attention, for such a root contributes to 
each xi(t) a polynomial of degree r — 1. Unless these polynomials are constants, the x(t) 
cannot be bounded. It is shown later in this paper that A can be diagonalized. Therefore if 
zero is an r-fold root the polynomials resulting therefrom are (positive) constants. 
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trivial solutions of the equations 
pom aay 0, t= 2,41. 4. 2., (13) 
. 


and in addition, by furnishing (6), affords restrictions (10). Im general 
the relations (5), (6), (9), and (10) are not sufficient to completely de- 
termine the nature of the roots,* \,;, of (12). 

By a well-known theorem (Aitken, 1948; Ferrar, 1941) the roots of a 
real, symmetric matrix are real. By another standard theorem (Aitken, 
1948; Ferrar, 1951) the roots of a matrix B are the same as those of A if 


VAV-=B, (14) 


where V is non-singular but otherwise arbitrary. We therefore seek a 
matrix V such that B is real and symmetric. If V is the diagonal matrix 


V= (65; vi] , (45:) 
then, from (14), 
_ | Gig 2s 
saa | v0; | ae 
and if B is to be symmetric it is required that 
eS Ths Se (17) 
vj UF; 
If v,, & = 1,2, ..., 2, can be chosen such that (17) is satisfied and every 


b,; is real it will be established that the roots of A are real and that the 
solutions, x;(¢), of (2) are non-pertodic. 

Equations (17), #(m — 1)/2 in number, may be considered as a set, 
linear and homogeneous in the » variables v, = vj. We require solutions 
such that every », < 0. The simplest manner in which to exhibit the re- 
strictions imposed upon the a;; by (17) is as follows. It zs assumed, for the 
present, that no aj vanishes. If a variable, v,, is arbitrarily chosen then 
there can be selected, from (17), 7 — 1 equations 


a,jV_, = AjrVj 5 a i Dome gett 
#~r 


in which v, appears. If v, is assigned the value unity, then the entire 
set of v; is given by 


(18) 


wp Te tei fe hae yg ts (19) 


*If n = 2, it follows at once that the non-zero root, \ = au + ax, is real and negative. In 
the case treated by Opatowski (1945, 1946), A is a continuant matrix, every a; = 1, and (6), 
(9), and (10) are enough to determine that every d; is real and non-positive. 
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If the v; from (19) are substituted into the remaining equations of (17), 
A; Vi = Ajj, G>tH=154, . ~ . 5) gy (20) 

there results the set of relations 
Aj, Aj jU jr = Vid ji dj (21) 


which are identically satisfied if any two of the subscripts, 7, 7, 7, are 
identical. 

For reasons immediately to become obvious we will refer to relations 
of the type (21) as cyclic relations. For brevity, (21) will be denoted by 
(ri 7) and a single member of the set will be called a 3-cycle. In general, a 
set of relations of the type (21) each of which involves o distinct sub- 
scripts, or o pairs of elements of A, aij. . . Gsi@ii = ji. « . Gee it, Will be 
denoted by (77 ...s5#), and a single member called a o-cycle. It is clear 
that, in (18), 7 can have any value except j and hence in (21), r can have 
any value distinct from 7 and /. In fact it is simple to show* that relations 


(21),:(% — 1) — 2)/2 = ( A ’) in number where Ge is the number 
of combinations of NV things taken s at a time, imply the set (77 &), 


(7, 7, 8 ¥ r], which has (G3 1) members. Therefore, of the total num- 


ber, ty of 3-cycles, (ee i are independent. It can further be 
shown that the 3-cycles (27) imply the o-cycles (ij... s#)., ¢ = 3, 4, 
...,%. Therefore of the total number, 2” — (Pe s — 1, of cycles 


there are (" a ’) which are independent and this is the number of 
3-cycles containing a selected subscript. 

Provided that (21) can be satisfied, the problem of finding a matrix 
V such that B, given by (14), is real and symmetric is mathematically 
solved. The elements of V are chosen as 2; = +/v;, with the v; from (19). 
The elements 6,; are then 

:.\12 
b5;= a,{ Stir) = (@;;4;;) 1/2 — bj. (22) 


a;,@,; 


That they are real follows at once from (9). We now consider the physical 
necessity of (21). 

If the system is in the steady state, equations (13) apply and serve to 
fix the ratios of the steady-state values of the x;. But for a closed system 
the steady-state is the equilibrium state (cf. discussion in Hearon, 1950a, b) 
and in addition to the requirement that every ¢; = 0 it is required that 


* This is discussed in detail in a forthcoming paper. 
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every process be balanced by its inverse. This is the familiar principle 
of detail balancing (e.g., Mayer and Mayer, 1947; Onsager, 1931). This 
additional requirement leads at once to (21). For, select from the system 
any three species X,, X;, and X;, denote explicitly in the usual way their 


mutual interconversion, 
X > X; 
ie a (23) 
Xj; 


and consider these transformations to be unimolecular (a, = 1 for all 2). 
It is required that the rate of the process X, > X; be equal to that of 
the process X ; — X,. According to Statement IIT and the argument which 
immediately follows it these rates are a;,x, and a, ; respectively. Similar 
considerations apply to the other two distinct pairs X,, X; and X;, X;. 
We therefore have as the conditions imposed by the principle of detail 


balancing, 
QipX p= AX; \ 


G,jX 5 = AjXy (24) 


AjgX 5 = AyjX;. 


The product of the three relations (24) is precisely (21). Stated otherwise, 
the first two equations of (24) give a relation between x; and «;. But this 
relation must be the same as the third relation of (24) and this is im- 
possible unless (21) holds. It obviously must be true, and it is not difficult 
to show, that if equations (24) hold for all r, i, and j, the x, so determined 
must satisfy (13). We will show this for the general case. If (13) is written 
as 

axtit D> aisez=0, i=1,2,...,0 (25) 


Fi 
then (10) and (25) give 


DS (as2s-= asaes)=0, RaMaD.. ates (26) 
a; 


Clearly, (26) is satisfied if 


A; 0;jX 7 = AjAziX; ty j=1, Phe cx Geo (27) 


But (27) can hold only if the relations (i 7 #) hold. For (27) gives 


SD eT ate el (28) 


xXi= 7 
1 Gs Gs; an Ok; 
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and also 


ea ee (29) 
ak Aki 
It is manifestly true that (28) and (29) cannot hold simultaneously unless 
the 3-cycles (ij k) are valid. From (27), o-cycles, ¢ > 3, can be derived 
in the same way. 

The presence in the system of cyclic chemical reactions, such as shown 
in :(23), demands according to the principle of detail balancing, cyclic 
relations, such as (21). In general, if it is possible to begin with X;, con- 
vert the species successively to ¢ — 1 other species, and return to X; 


X 3 X&;, 


| 


X <> + ee 


there must exist a o-cycle (jk ...5),. If every a;; is different from zero 
every species can be converted to and formed from every other. The 
system is then said to be completely cyclic, for every o-cycle, o > 3, then 
exists. If some of the a,; are zero the exact manner in which (17) are to be 
solved will be different in each case. It will now be shown that if zero ele- 
ments occur symmetrically about the main diagonal of A, (17) can be 
satisfied if certain o-cycles are satisfied. This is in contrast to the com- 
pletely cyclic case in which all 3-membered chemical cycles containing a 
selected species, X,, exist and (17) can be satisfied by (19) if all 3-cycles 
(rij) are satisfied. 
Let (17) be written as 
Ay Vi = Aj; , oe ea ee 
(30) 
i<j gn. 

It is assumed that if a;; = 0, then a;; = 0. Certain members of (30) will 
then be identically satisfied. For definiteness, assume that a1; = a, = 0 
for 7 = m, s, t. We exclude the trivial case (see Statement II) in which 
im, Gis, and aj are zero for every i + m, s, t. Accordingly if »; is assigned 
the value unity, (30) with i = 1 gives 


yj=—, jAm, s,t (31) 


and there are, among the remaining equations of (30), three which de- 
termine »;, 7 = m, s, t. Assume that there is an 7 < m, s, t such that the 
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members of (30) with 7 = 7,7 = m, s, tare not indentically satisfied. Then 
(32), 


Q,,; . - 
1) a 1=Mm, S,t, (32) 


Gr Q jp 

and (31) comprise a complete set of v;. The remaining members of (30) 
with z = r determine the 3-cycles (1 7), 7 ¥ m, s, t. Now for all & such 
that » is given by (31) and a,; + 0, 7 = m, s, t, the members of (30) 
withj = k,i = m,s, t, together with (32), determine the 4-cycles (177 k), 
1 =m, Ss, t. The members of (30) with j ¥ m, s, t, together with (31), 
determine the 3-cycles (17/), 7 # m, s, t. The extension to the general 
case is simple: In general, (31) may fail to determine any number of »,. 
The condition on (31) is then 7 # m,s,t,..., p. If there is an y such that 
(32) determines v;, 7 = m, s,t,..., p, the above argument goes through 
with the triple of indices, m, s, ¢, replaced everywhere by the arbitrary 
number (< mu — 2) of indices m, s, t,..., p. 

If there is no single value of r such that the members of (30) withi = r, 
j =m, S,t,..., p are not identically satisfied there are two major possi- 
bilities: 

(i) If values of 7, (say i = R;, 7 = m,s,t,..., p) can be found such 
that the members of (30) with i = R; andj = m,s,t,..., p determine 
v;,7 =m, 5S, t,..., p where R; may be different for each 7 and 7s not one 
of the set m, s,t,..., p the argument is as above, with obvious modifica- 
tions. In particular, in (32) 7 = m, s, t,..., p and r is replaced by R;. 
Then (30), with 7 = k,i = 1, s,t,..., p, together with (32), determines 
the 4-cycles (1 R;7k), 7 =m, s, t,..., p for all & such that », is de- 
termined by (31). 

(ii) There is always an r such that one of the v;,7 = m, 5s,t,...,B, 
is determined by (32). It is no restriction to suppose that v,, is so de- 
termined. Now it may be that the members of (30), when 7 has one or more 
of the values s, t,..., p, are identically satisfied unless 7 belongs to the 
set m, s, t,..., p. For example it may be that the only member of (30) 
which serves to determine y, is that with 1 = m,7 = s. Then 


— Gir4rmOms (33) 


Vs — Pi 
4714 mrdsm 


The remaining v;, 7 =¢...p may or may not be determined by (30) 
when 7 does not belong to the set m, s, t,..., p. If they are, these re- 
maining y; are determined as in (i) above. If they are not, these remaining 
y; are determined by formulae which are extensions of (33). For example, 
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if vy, is determined by (30) with i = s, 7 = #, then 


v= Q,ArmdmsAst (3 4) 
Aria mrtsmFts 


and if this situation repeatedly occurs, then finally 


oie 
ae U,4rm = (35) 
ApiGmr+ ++ Upl 


where the products such as @1-drm . . . @i can contain no more than” — 1 
elements of A. The equations, if avy remaining of (30), together with 
(31), (32), or its modification in (i), (33), ..., (35) determine various 
o-cycles where certain values of « may not be represented. 

Evidently, if there are no equations remaining in (30) there are no 
conditions (in the form of cyclic relations) upon the a;; and this will occur 
when there are 2 — 1 equations of (30) which are not identically satisfied. 
This implies that when no chemical cycles occur in the system there are 
2(m — 1) non-zero elements a,; (this result is easy to prove directly*). 
Further, these elements are independent, for the principle of detail balanc- 
ing imposes no relations among them. A notable special case of this kind 
is that in which A is a continuant matrix: In (30),7 = 1,2,...,”—1, 
and 7 = i+ 1. The system is a straight chain of consecutive reactions: 
X, = X,.s...& X, (cf. discussion, Section 7 of this paper). 

The above assumption, that if ¢;; = 0 then a;; = 0, can be justified 
on physical grounds: From (27), if the assumption were ot valid the 
equilibrium values of certain of the x; would be zero. This is contrary to 
experience and implies that certain equilibrium constants are infinite. 
Stated otherwise, it cannot be admitted that any chemical reaction is 
absolutely irreversible, in the kinetic sense (cf. discussion Hearon, 1950a), 
for by well known thermodynamic principles (see discussion, Section 7, 
this paper) this implies that the standard free energy change of the re- 
action is infinite. Of course, the assumption a,, = a,, = 0 amounts to 
the physical assumption that the transitions X, > X, and X, > X, do not 
occur. This is a familiar assumption which for practical purposes can be 
justified on the basis of experience although (Mayer and Mayer, 1947) in 
rigorous principle it is never valid and, by the same argument, no system 
has ever been observed to come to complete equilibrium. 


* Denote by N(n) the number of non-zero aj; for a non-cyclic system of species. Then, 
since each additional species contributes two additional a;;, N(n + 1) — NV (n) = 2. The 
solution of this difference equation is N(m) = 2 + w(n), where w(n) is periodic with period 
one. Evidently V(2) = 2. Therefore w(2) = w(n) = —2 and N(n) = 2(n — 1), which was to 
be proved. 
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The important features of the above arguments may now be sum- 
marized as 

Statement IV: The roots of the matrix A are real and the solutions x ;(#) 
of (2) cannot be periodic functions of time. The requirement of finite 
equilibrium constants, or standard free energy changes, for the allowed 
individual reactions of the system implies that zero elements, a;;, if any, 
occur symmetrically about the main diagonal of A. Then, the set (30) 
can always be solved in such a way that a matrix V, given by (15) with 
0; = V/v;, converts A, by a similarity transformation, into a real, sym- 
metric matrix. For, if there are only m — 1 equations the »; are given by 
equations such as (31) through (35) and are, by (9), necessarily positive. 
If there are more than  — 1 equations the set (30) is consistent and 
equations such as (31) through (35) represent acceptable solutions if cer- 
tain cyclic relations are satisfied. That these relations be satisfied is de- 
manded by the principle of detail balancing. 

It is now simple to prove that the roots of A are non-negative. In order 
to achieve this, and for later purposes, we prove the following 

Theorem: Consider the real, symmetric matrix C. Let there be real 
quantities yy; and p; such that 


m0, k=1,2,...,0 (36) 
Do 1rbre= 02, S=1,2,...,%. (37) 
Then if 
is Oo whiied 952 ointpm Vedi fin (38) 
ViYi 
and 
eee eee dg ee (39) 


where at least one inequality in (39) holds, the matrix C is negative definite 
and, if the inequalities in (38) and (39) are reversed, C is positive definite. 
If the equality in (39) holds for every i, C is semi-definite. : 
Proof: Evidently, if the equality in (38) holds for all distinct 7 and 7, 
the theorem is trivial, for C is then diagonal and the ratios p;/7: are, by 
(37), the diagonal elements ci:. We exclude this case. 
The quadratic form Q(y, y) associated with C can be written as 


20 (4; y) a De cust 2 DY De Cif ViVGt+ >> CiiVj° (40) 
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From (36) and (37), the diagonal elements of C are 


om one nto, b= 1p Dont oe (41) 


k rk 


If (41) is substituted in (40) and the symmetry of C is explicitly invoked, 
there results 


(y, 9) = ae pi yi AD De ooo (42) 


Under the conditions of the theorem the double summation is non-nega- 
tive and the single summation is non-positive for all real sets of values 
of the y,. If not every p; is zero Q(y, y) is negative definite. Otherwise it is 
negative semi-definite. The situation is equally obvious when the in- 
equalities in (38) and (39) are reversed. 

As discussed above, the matrix B determined by (41) is real and sym- 
metric if V is properly chosen. Further, by (17) and (9), b:; > 0,7 #7. 
There exist real quantities 8;, all different from zero and of like sign, such 
that 


D>) bibis=0, G=1,2,..~,0 (43) 


since, by (6) and (17), the choices 8; = a,/v; meet these conditions. 
Applying the theorem to the matrix B, condition (38) is satisfied, the 
equality of (39) holds for all 7, and B is negative semi-definite. Thus we 
have 

Statement V: The latent roots of the matrix A are non-negative. We 
will refer to a matrix which possesses the properties described in Siate- 
ments I through V as an admissible rate matrix, or simply an admissible 
matrix. 

4. The open, uniform system. It will be realized that if the rates of pro- 
duction (per unit volume) of the chemical species of an open system are 
linear combinations of the concentrations, the matrix of the coefficients 
of these linear combinations must be an admissible matrix. For the sys- 
tem becomes closed if it is required that the flux of every species vanishes 
at the surface of the system. It is easily seen that the equations of con- 
tinuity then reduce to (2), if in (2) every x; is replaced by the (spatially) 
mean value of the concentration of the ith species. The arguments neces- 
sary to establish the properties of an admissible matrix can be carried 
through without further change. The operation of closing the system (e.g., 
by surrounding it by an impermeable envelope) cannot effect the elements 
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of the matrix if we make the usual restriction that only isothermal sys- 
tems are considered. 

Consider an open system for which the spatially uniform internal con- 
centrations are x,(¢) and the external concentrations, spatially uniform 
and constant with respect to time, are C}. Then the set (2) is replaced by 


i= > (a:;— pis) %j+ 2) PisCd, (44) 


where the p;; are the products of the generalized permeability coefficients 
(Hearon, 1950b) and the surface-volume ratio of the system. If the simple 
laws of diffusion are assumed to apply, then P is the diagonal matrix 
[5;;p:;| and if A is an admissible matrix the roots of the matrix A — Pare 
real and non-positive. A matrix V can be found, precisely as discussed in 
Section 2, such that V(A — P)V-! = B — Pis real and symmetric. The 
permeability coefficients are non-negative. Therefore P is positive definite 
or semi-definite and B — P is negative definite. 

The time course of the approach of the «; to their steady-state values is 
governed by the solutions of the homogeneous equations associated with 
(44) and we have shown that these solutions cannot be periodic if P is 
diagonal and A is admissible. 

5. The open, non-uniform system: approximate methods. If the simple 
laws of diffusion are assumed and the diffusion equation is approximated 
in the manner suggested by Rashevsky (1940, 1948c), then (44) is re- 
placed by 

{i= De (ay = Biel) FANG, (45) 


where £; is the mean concentration of X,, A, is a constant, and Cj is the 
unperturbed external concentration. The argument of Section 4 applies: 
P is replaced by the diagonal matrix [6;;/A;] and A; > 0. 

Therefore accepting the approximation method of Rashevsky, we 
have only to insist that A be admissible to insure that the #; are non- 
periodic functions. 

A refinement of the standard approximation method has been suggested 
(Rashevsky, 1948b). The system is divided into regions and the diffusion 
equation for each species is approximated in the manner of the House- 
holder tri-axial cell (Householder, 1942). If by C; is denoted the mean 
concentration of the ith species in the jth region, the differential equations 


are 
dt Nw Wen ee 4a 
qe ra anCi+ > mn (Ce- C3) ; (46) 
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where it is understood that there are p regions and m species and (46) is 
a set of np = N equations. Here, mj, is a parameter characteristic of 
regions j and r and of the substance X;. The zeroth region is the environ- 
ment and M}, will depend also upon the permeability of the surface of the 
system. The quantities Cj are the unperturbed external concentrations 
and are to be regarded as constant. Clearly, if regions j and & are not con- 
tiguous then m}, = mj; = 0 for all 7. It is not difficult to see that if new 
variables y, are defined by C} = yQG—-n+i,t = 1,2,...,%,7 = 1,2,..-5,B, 
then the differential equations for the y,, k = 1, 2,...,N, can be 


written as* 
y=Ryt Y. (47) 


Here, R is the partitioned matrix 
Ei My My...Miy 


Mra Ey Mo; . - - Moy 
R= ; ‘ y : (48) 


Moy Magia oat te 
where the sub-matrices, square and of order n, are 
E,= A— [d:jmi), (49) 


M;;= [5,.mi5] , (50) 


the quantities m; being defined as 
- P - 
r=0 


By y is denoted the column vector of the y, and the constant column 
vector Y is 
VY =M jy , (52) 


where Cy is the column vector of the Cj. Now it is possible to construct a 
partitioned matrix T = [6,;7,] such that TRT— is real and symmetric, 
provided that A is admissible. The sub-matrices on the diagonal of TRT— 
are TE ,T-} and, by (49), they are symmetric and real if 7; = {;V, where 
fi is any real, scalar factor and V is the matrix discussed in Section 2. The 
sub-matrices in the off-diagonal positions of TRT— are V ;M;;V3'. Since 
the M;; are diagonal, the product TRT— is symmetric if T;M hg. 

* It is to be noted that in equation (3) of Rashevsky (1948b) the signs of the flow terms are 


incorrect and that in his equation (5), which is equivalent to our (47), the constant terms have 
been omitted. 
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= T;M;,T;", or, since diagonal matrices are commutitive, if 
SiM i; = $3Mj: « (53) 


If the volume of the 7th region is 7,, it is clearly a physical necessity that 
Tm; = 7 smi;;, for all i, 7, and k, and hence from (50) 


7iM;;=7;Mj; , 1, I= 1X2, cans 1A]. (54) 


According to (54), if fe = 7x, k = 1, 2,..., p, then (53) is satisfied. 
Thus it has been proved that the roots of R are real. The vector y can- 
not be a periodic vector. Straightforward application of the theorem of 
Section 2, with due respect for (49), (50), and (51), shows at once that the 
roots of RF are negative. 
6. The open non-uniform system; general case. If the general case 


F 0C 
dS Diwe;+ SY a —— ag bap eae | (55) 
7 7 


is considered, a definite answer to the question of the periodicity of the C, 
can be supplied only if the matrix, [Dj,], of the diffusion coefficients is 
diagonal. It can then be shown that 7f [aj] is am admissible matrix the 
eigen-values of the set of Sturm-Liouville equations to which the Laplace 
transform of the set (55) gives rise are all real. This implies that the C; 
cannot be periodic functions. While this demonstration depends directly 
upon the properties of an admissible matrix as defined here, it otherwise 
involves considerations very different from the subject matter of this 
paper. These results, which in no way depend upon the geometry of the 
system, will be included in a forthcoming apper (Hearon and Sangren, 
1953). It may be noted in passing that it would indeed be a severe indict- 
ment of the approximate method (Rashevsky, 1940), and the refinement 
thereof (Rashevsky, 1948b), if (55), with Di; = 0, k ¥ j, gave rise to re- 
sults basically different from those of Section 5. 

The nature of the steady-state solutions of (55) is easily seen in the 
case of spherical symmetry when Dj; = 0, k #7. For the steady state, 
(55) becomes 


DVYC+ AC=0, (56) 
where D is the diagonal matrix of the diffusion coefficients and C the 
column vector of the C;. The transformation 

C=Gé 
G=V"'5, 


(57) 
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where V is the matrix discussed in Section 2 and S is the matrix which 
diagonalizes the symmetric matrix VAV~, gives 


DVE+AE=0, (58) 


where A = [5,,;A,] and the ),, the roots of A, are non-positive. The solu- 
tions of (58) are 


- 1 nee (59) 
f= Ki, 


where e; = \/—NA,/D: and the roots are labeled so that \; = 0. Therefore, 


C; = pas gis 5 
; 


sn By (60) 
Sl ; 

= Kit >» gis K; re ~ 
7=2 


7 


The special solution, &, of (59) could have been anticipated. For if A is 
singular, some linear combination of the C; is a harmonic function. If A 
is an admissible rate matrix this linear combination is aDC where a is the 
row vector of the a, of (6). This statement, of course, does not depend 
upon D being diagonal. 

7. Discussion and summary. From the standpoint of conventional 
chemical kinetics the arguments upon which Statements I and IT are based 
are unnecessarily involved, for if the rate matrix for a set of chemical reac- 
tions be written down the contents of these statements can be verified by 
inspection. The arguments are such, however, as summarized in Statement 
ITT and the discussion thereof, that if a formal description is attempted in 
terms of linear equations then the only possible interpretation is the 
stoichiometric conversion of the X; one into another. 

The physically simple condition of the conservation of mass is mathe- 
matically a far-reaching one, for this condition alone is sufficient to estab- 
lish that |A| = 0, that for any given j the a,; cannot all have the same 
sign, that the real parts of the roots are non-positive (see first footnote) 
and that there are no pure imaginary roots (see below). The fact that con- 
centrations are positive is sufficient to fix the signs of the a,;,7 ¥ ;. These 
two conditions are sufficient to insure the reality of the roots for the case 
n = 2 or for any case in which there are only 2( — 1) off-diagonal non- 
zero a,;. In fact, if » = 2, the conservation of mass completely deter- 
mines the character of the roots for one root is zero and complex roots 
must occur as conjugate pairs. 

In general, the principle of detail balancing must be invoked in order to 
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prove the reality of the roots. That they are non-positive then follows 
from the conservation of mass by physical reasoning, by the theorem of 
A. Brauer (1946) or by the theorem of Section 2. A necessary condition 
for applying the principle of detail balancing is that there exist non-trivial 
solutions of equations (13). As noted, this is insured by the conservation 
of mass. It might be questioned that these solutions actually correspond 
to stationary values for the x,, ie., if pure imaginary roots occur the 
«,(¢) might oscillate in such a way that solutions of (13) exist although 
the system is not actually time independent. If the units of x, are chosen 
such that every a; = 1, the theorem of Brauer (Joc. cit.) may be stated as 
follows: Every root of A lies in one of the circles in the complex plane 
with centers at a;,; and radii > a;;. Equations (10) then show that these 
71 

circles lie in the left half-plane and are tangent to the axis of the imagi- 
naries at the origin. There are, therefore, no pure imaginary roots, the real 
parts of the roots are non-positive, and the x,(t) approach true steady 
state values ast— ~. 

From the practical point of view the argument leading from (30) to 
(31)-(35) perhaps requires additional comment. The hypothesis upon 
which the argument is based means physically that if the transition 
%,—>%, occurs (i.e., ad. > 0) then the reverse transition occurs also 
(a,, > 0). The value of a,, may be extremely small and for certain prac- 
tical purposes it may be permissible to neglect @,; with respect to ds, or 
with respect to any other element of A. Mathematically, the hypothesis 
requires only that a,,/a,, be finite; the actual numerical value is irrele- 
vant. It is well known that for practical purposes certain reactions may be 
considered kinetically irreversible. It cannot, however, on this basis be 
asserted that the hypothesis is invalid. 

The history of the cyclic relations of Section 2 is of interest. Over fifty 
years ago R. Wegscheider (1901) published a remarkable paper in which 
it was pointed out that in certain cases the mass-action expression as ob- 
tained from chemical kinetics does not agree with the classical law of 
mass-action as derived from thermodynamics unless a certain relation 
among the rate constants can be assumed. This relation corresponds pre- 
cisely to a 3-cycle as defined in Section 2. This situation was long known 
as “Wegscheider’s Paradox.” Nearly a quarter of a century later GN. 
Lewis (1925) showed that this “paradox” could be resolved at once by 
the principle of detail balancing. This seems to be the first instance of the 
application of the principle, well known in statistical mechanics, to purely 
chemical considerations or to physicochemical processes such as phase 
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changes. Six years after the appearance of the Lewis paper, L. Onsager 
(1931) discussed the 3-cycle as an example of the now famous Onsager 
reciprocal relations. He stated that such relations are not a thermody- 
namic necessity. However, the free energy chanage for conducting a 
chemical reaction around a closed cycle is of necessity zero. Further, this 
free energy change is equal to the standard free energy change for the 
process. If this quantity is taken to be —RT/nK, where K is the equi- 
librium constant for the cyclic reaction, it is clearly necessary that K = 1. 
But K is the product of the equilibrium constants for the individual steps 
in the cycle and K = 1 is just a o-cycle, as defined in Section 2, if there 
are o steps in the cyclic reaction. The cyclic relations could have been 
derived in this manner. This argument requires however the usual as- 
sumptions regarding the relation between an equilibrium constant and 
certain linear combinations of the standard chemical potentials as well as 
the relation between the equilibrium constant and the forward and re- 
verse rate constants. While these assumptions are universally granted, the 
latter may be said to lie outside the framework of classical thermody- 
namics. 

The relation between the present work and certain previous investiga- 
tions of periodic reactions requires little discussion. The case of Section 4 
is a generalization of that treated by Rashevsky (1938, 1948c). The first 
case of Section 5 is a generalization of that treated by Weinberg (1939), 
the second case a generalization of that treated by Rashevsky (19480). In 
the more general case treated by Weinberg (1938) the characteristic equa- 
tion is |A — \D| = 0, where D is the diagonal matrix of the diffusion 
coefficients and A and D are of order 2. It follows at once that if |A| = 0 
one root is zero. The other is necessarily real, it is \ = @yn/Dy + 22/ Doo 
and therefore negative. In this case, then, the conservation of mass is 
definitive as regards the reality of the roots. 

It is worth noting here that although the assumption of linear differ- 
ential equations is very restrictive for general chemical systems there is a 
large class of cases in which this is valid. The concentrations of isotopical- 
ly tagged species will obey linear equations provided that they are present 
at tracer level. If the system is stationary with respect to the normal 
species the coefficients in these equations are constant. The formalism of 
this paper then is applicable to isotope kinetics. In particular the second 
case of Section 5 corresponds to the diffusion of 2 species between p com- 
partments with general chemical reaction in each compartment. 
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The necessary and sufficient condition is given for integers to be the score structure of a 
society with a dominance relation. 

A proof is also given for a theorem showing that there are members who dominate every 
other member either directly or indirectly through a single intermediate member. 


The score structure of a society with a dominance relation (Landau, 
1951) was defined as a set of m integers Vn = (vo, 01, . . . , Uni), Where 2; 
denotes the score of the ith member, i.e., the number of members domi- 
nated by the zth member. Here we are giving the index 7, the range 0 to 
n — 1 instead of 1 to as before. It is also convenient to modify the 
definition of a dominance structure matrix; this is now defined to be a 
matrix B, with elements b;;, where.b;; = 1 if 7 dominates j and 6;; = 0 
otherwise, so that b;; = 0 and b;; = 0 if b,; = 1. Then the 2; are the row 
sums of the dominance structure matrix 


where the rows and columns are numbered from 0 tom — 1. 

We prove the following theorem. 

Theorem. The n non-negative integers (vo, 11, . . . , Yn-1) will be a score 
structure if, and only if, 


eS eee for R= 1; 2, 0.., 0-1, (1) 
rs 2 
where DEF is the sum of any & of the 7;, and 

k 


eal n(n—1) (2) 


Cie at ame 


1=0 
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Proof. The necessity for (1) and (2) can be seen from the interpretation 
given previously (Landau, loc. cit.) of the score structure as the scores of 
the players in a tournament consisting of a single round robin, with no 
games ending in draws. Any & of the players taken together must have 
scored a total number of wins at least equal to the number of games they 
played with each other, which is k(& — 1)/2. This is all that is required 
by (1) and (2). 

Formally, the necessity can be shown as follows. Strike out any » — k 
rows and the corresponding columns of B. Then for the resulting row 
sums, v,, we have v; < v;. Each of the k(R — 1)/2 pairs of elements sym- 
metric with respect to the principal diagonal consists of a zero and a one, 
and the diagonal elements are zeros. So 


See —— 


This proves (1), and (2) is necessary by definition. 

The proof of sufficiency will be by induction. For x = 2, conditions 
(1) and (2) give the score structure (0, 1). Assuming then that for any 
m non-negative integers V, = (vo, 11, . . . , U1) Satisfying (1) and (2) there 
is a dominance structure matrix with these v; as its score structure, we 
must prove the same for any Vasi = (v0, 11, . . . , Un) Satisfying (1) and (2). 

It is convenient to assume that the v; are arranged in ascending order 

Ups ee ve, Os (3) 


so that (1) is equivalent to 


Rik=T} 
Vv —— 


oon ; re bens 0% (4) 
1=0 
For Vas3, (2) is 
> = ee (5) 
i=0 
so that from (4) and (5) 
(n+1)n mn(n—-1) 
oe en en nN, (6) 
If v, = n, then V, = (v, v1, ..., Unt) satisfies (1) and (2). If to its 


dominance structure matrix, B,, we add an (m + 1)st row and column 
with 
bg b> <for gee a pee ed 


b,=0, for 4m 1 ene 
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the resulting matrix, B,4:, will be a dominance structure matrix with 
Vn as its score structure. 

Now let v, < ”, and write d = n — v, > 0. It will be shown below 
that: (A) there are at least d values of i for which v; exceeds its hierarchical 
value, i. e. , 0; = 7+ e; with e; > 0. We denote these by 2;. The d largest 
of the 2, are reduced by subtracting one from each of them, giving 
v; =v; — 1 for the d largest v;, and the other 2; are not changed. Let the 
resulting set of modified v;, for? = Oto m — 1, be Va = (ao, i1,. . . , Tn_1). 
It will be shown that: (B) the 4; satisfy (1) and (2) and hence by the in- 
duction hypothesis V, is a score structure. Now we can augment its 
dominance structure matrix by adding an (2 + 1)st column with the ele- 
ment 1 in the rows from the 2,’ and 0 in the other rows, and also an 
(w + 1)st row with 0 in the columns of the v;’ and 1 in the other first n 
positions. The resulting matrix has Vayi = (v0, 11, .. . , 2n) for its score 
structure, proving the sufficiency of (1) and (2). 

To prove (A), we first note that when 7; = v; = 7 + e;, the e; — 1 fol- 
lowing scores 0;41, . . - , Vize;_; also exceed their hierarchical values. This 
is because the ordering givesv;;; > 0; =i+e;>i+jforj =1,2,..., 
Ce. tis 

Now suppose there are D of the v;. Let the v; in sere cibg order be 

< vj, <...< vi, where % is the index of the &th v’. Then e,, < D, 


. <D—1 and generally e, < D—k nr 1, because otherwise there 


aan be more than D — & of the a’ after v;,, from the statement of the 
preceding paragraph, and then the total number of v’ would exceed D. 
From this we have 
= D+1)D 
SS a oe cee eee (7) 
k=1 


We can now prove that D = d, which is the statement of (A). From 
n—d=%2 i he Un—a+1 3 (8) 


we see that none of the d largest of the 2; is a v’, also 


; vj<d(n—d). (9) 


i=n—dtl1 


Since all the v’ are among the first 7 — d + 1 of the v,, using (7) we have 


uc Si +45 epee 8) ee ea ta (10) 
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Adding (9) and (10) 


D> SF [2d (n= d) + (n— 4) (n—d +1) +D(D41)1 
TMS pnts ty ed ee De 
but from (5), 


So 


i=0 2 
so that 
D(D+1) 2 d(d+1) 
or 
Deas (11) 
To prove (B) we show that (4) is satisfied by the ,. First when 
k>n—d+1, (12) 
we have using (9) 
Sie n(n $2) 
>» vanes Us mn eee Cae (n—k+1) (n — d) 
= 4) 4 a4 (nb) (d-* 5) 
oC Be De 


because of (12). Hence the 4,;, obtained from the 7; by subtracting 1 from 
d of the v; satisfy (4) fork >n —d +1. 

Now when k < n — d + 1, (4) is satisfied by hypothesis if none of the 
v, are in the range 0 <7 < k — 1. Suppose that d, of the v’; are in this 
range with 0 < d,; < d. Then using (9) 


k—-1 n nd 
: y= > ve y i v; 
i=0 i=0 i=n—d+1 i=k 


(13) 
n--d D 4 
= Be ad (wd) »S ei, 


=k k=D—d+d,+1 
where the last sum is over the e; of these v; with k <i <n —d. 


There are exactly d — d, of the v; in this range so that by the reasoning 
used for (7) 


D 


So, < aw da hty 
k 2 


k= D—d+d,+1 


a 
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Then from (13) 


k=1 
> Porat tt) —d(n—d) 
i=0 


_(m=d+h) (n—d-k4+1) (d-dh) d-dh, 41) 


RA th) 
a 


AS rele (14) 


So that (4) is satisfied by the 3. 
Now (4) is equivalent to oe if 
oe (15) 
Since the v; were in ascending order, (15) will hold for the a unless the 


smallest v’’, say 2, is equal to 2,_,; which is not reduced. Then 9,_; 
=%,_1 > i,. However, since i, = 7, — 1 >s — 1 and 


Luz Sieg? ioe, 


so (1) is satisfied. If there are more than one 2; equal to v, there will be 
more than one departure from the ascending order but it is clear that, by 
a calculation like the one just given, (1) will be satisfied. Since (15) holds 
for all other 4, this proves that the 3; satisfy (1). 

Equation (2) holds for the 3; because 


n—1 n 
SS 5:= Do u-—2,-d 
i=0 i=0 


n(n+1) _n(n—1) 
= the (n-d) -d =". 


This completes the proof of (B) and of the theorem. 
From (1) and (2), we can get other inequalities for the »;. For example, 


FOE, (Mo, By. ae 5 Pra) 


es e(n-*F*), (16) 
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and if the v; are in ascending order 
n+i-—1 


5 (17) 


s5uS 

In an unpublished report F. E. Hohn stated the theorem that in a so- 
ciety with a dominance relation there is at least one member 7 such that 
for every member 7 + 7, either 7 dominates j or there is a member k such 
that i dominates & and k dominates 7. This result is due to H. E. Vaughan 
(1952) and is a consequence of a theorem of his on sets in which an anti- 
symmetric relation is defined. We give a simple proof of this theorem 
which also shows that this property is possessed by every member whose 
score is equal to the maximum score. Of course, other members with 
smaller scores may possibly have the same property. 

Theorem. If v; is equal to the largest score obtained by any member, 
then for any j + 7 either 7 dominates j or there is a member & such that 7 
dominates k and k dominates 7. 

Proof. Suppose the largest score is v = v;. Then 7 dominates v members, 


call these 71, j;,..., 7,,and 7 is dominated by the remaining m — 1 — 
members, j;, 73, ---,J,—1-». We must show that each of the 7” is 
dominated by at least one 7’. If this were not true for say 7;’, then j{’ would 
dominate j;, 73, . . . ,7, as well as 7, so that v;;" > v + 1. But vis the largest 


score obtained by any member. Hence the theorem follows. 

In terms of the dominance matrix, the theorem states that for every 
j + 1, either b;; = 1 or there is a k for which 6.,; = 1. Then if we use 
ordinary matrix multiplication, the theorem states that in B? + B the 
off-diagonal elements of the 7th row are all positive. 

This work was aided by a grant from the Dr. Wallace C. and Clara A. 
Abbott Memorial Fund of the University of Chicago. 
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It is demonstrated that an explanation of the small radius effect or the so-called sigma phe- 
nomenon may be obtained by noting that one of the effects of the presence of suspended par- 
ticles in a flowing fluid is to increase the velocity of flow near the wall over that existing in the 
absence of particles. This effect may be considered equivalent to relaxing the boundary condi- 
tions at the wall. 

An expression for the viscosity is compared with data and the fit is found to be good. 


There have been a number of observations (Bayliss, 1952) that for 
blood flowing through narrow tubes the apparent viscosity of the blood 
depends upon the radius of the tube. For tubes of radius 0.015 cm. or less 
the viscosity decreases as the radius decreases. Of course, for extremely 
small tubes it becomes impossible for the blood to flow at all, the flow 
being blocked by the red blood cells. However, when the tube is large 
enough to permit flow, the above phenomenon is apparent and, indeed, is 
quite marked. 

That the anomaly is due to the presence of the blood corpuscles is 
shown in a direct fashion by its absence upon their removal. 

The effect is common to a variety of fluids in which there are suspen- 
sions of small, insoluble particles, such as talc suspensions, clays, paints, 
and mud (Scott Blair, 1938). In such rheological work the phenomenon 
has been known as the sigma effect. 

An interpretation of this phenomenon has been given by F. J. Dix and 
G. W. Scott Blair (1940). Their treatment consists of assuming that flow 
takes place in layers, i.e., in a region 7 to r + 6 (r being the radial co- 
ordinate) the fluid has a uniform velocity, 1; in the region r + 6 tor + 26 
the fluid again has a uniform velocity, %., etc., where % < %, 03 < 2, etc. 
The picture is essentially one of telescoping hollow cylinders slipping 
through one another. In addition to the parameter 6, their treatment uses 
the concept of a coefficient of friction # which measures the stress neces- 
sary to move one layer across another when the layers differ by one unit 
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of velocity. The product F'6 corresponds to what is ordinarily considered 
the viscosity of the fluid. 

For our purposes the essential result of their work consists in the 
derivation of the expression 


te = (142 (1) 


where 7 is the apparent viscosity when a tube of radius a is used and 7, 
is the viscosity when a very large tube is used. The viscosity in the ac- 
cepted sense of the term is, of course, 7,. 
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Ficure 1. Viscosity of human blood. Data from Fahraens and Lindqvist (Joc. cit.). The plot 
of 1/n vs. 1/a should be a straight line according to equation (3). Viscosity is measured rela- 
tive to a fixed standard and the units shown have no absolute significance. 


L. E. Bayliss (Joc. cit.) has compared equation (1) with observed data 
and has found that the fit is good. 

In this note an alternative approach to the problem will be attempted. 
We shall first analyze, in a qualitative way, the influence of suspended 
particles on the velocity profile of a liquid flowing in a tube. 

In the absence of particles the velocity is parabolic. A particle in a fluid 
is pushed by the fluid behind it, is dragged by (or is dragging) the fluid at 
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its sides, and is pushing the fluid in front of it. It is clear that if the 
velocities on the two sides of the particles differ, then the presence of the 
particle will tend to minimize this difference. Now a particle near the cen- 
ter of the tube will hardly affect the profile since the velocity gradient in 
its absence is quite small. On the other hand, a particle near the side will 
tend to increase the velocity of fluid at the side. Therefore, if the particle 
size is not much smaller than the tube size, say of the order of + or so, 
then the velocity close to the wall will be much larger than if no particles 
were present. In other words, there will exist a layer of fluid, say of thick- 
ness /, at the wall across which the velocity changes abruptly. Now if / is 
small we may then, for all practical purposes, say that the fluid slips at the 
wall. We have now translated the original problem into the following 
problem. What is the flow of fluid through a tube if the fluid slips at the 
wall? H. Lamb (1932) treats this problem using as a boundary condition 
at the wall 


Velocity = Constant X Stress. (DS 


This boundary condition is correct for our problem if / is small enough 
so that the velocity may be considered a linear function of distance 
throughout the boundary layer. Clearly the acceptance of (2) involves no 
assumption that has not already been made. 

Call the constant in (2) £. 

Lamb then shows in a straightforward fashion that the amount of fluid 
flowing per second through the tube is given by 


where AP is the difference in pressure across a tube length L. Evidently 
this result is equivalent to stating that the fluid has an apparent viscosity 
n, dependent upon the tube diameter, where 


eee ie dy (3) 


n No a 


Equation (3), for our problem, expresses the small-radius effect or the 
sigma phenomenon. This equation is compared with the data of R. 
Fahraens and T. Lindqvist (1931) in Figure 1. 

This investigation was supported by a research grant H-627C from The 
National Heart Institute, of the National Institutes of Health, Public 


Health Service. 
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The solution for the spatial distribution of ions in a Donnan equilibrium has been given by 
J. H. Bartlett and R. A. Kromhout (1952). The present note gives an explicit solution for the 
case in which the length of the region containing the membrane is large; in biological situations 
this requires only that the length considered should be greater than a few hundred Angstrom 
units. The Donnan equilibrium may be considered to be a special case of a situation in which 
forces other than electrical act upon the ions; in particular, it represents the case in which only 
one ion is acted upon and the energy difference on the two sides of the membrane is infinite. 
An expression is given for the difference in energy of the ith in terms of the electrical potential 
and of the ion concentrations. As an illustration, the results are applied to nerve membrane 
potentials. 


In a recent paper Bartlett and Kromhout (Joc. cit.) have given the solu- 
tion for the spatial distribution of the potential and of the concentrations 
of ions for the situation in which a solution of ions is separated into two 
regions by a membrane which is impermeable to one of the three kinds of 
ions present. It is the purpose of the present note to give the solution ina 
simpler form for the case in which the length of the vessel is sufficiently 
large. Since the linear dimension of the region considered does not need to 
be much greater than 100 A for this approximation to hold for biological 
material, this is not a very great restriction. The results are then gen- 
eralized to the case of ” ions in which nonelectrical forces are acting upon 
the ions within the membrane which separates the vessel into two regions. 

Following Bartlett and Kromhout, we suppose that the ions Ht, CI, 
and R+ are present on the left side of the membrane at concentrations 
C1, C2, and cs respectively, but that only H+ and Cl are present on the 
right side. We shall consider a situation in which the concentrations are 
small enough so that the activities can be taken to be unity. The effect of 
osmotic pressure will be neglected. Let F be the Faraday constant, 
2.895 X 10" e.s.u. coulombs per mole, R the gas constant, 8.315 X 10’ 
ergs per degree, T the absolute temperature, D the dielectric constant of 
water, and cio the concentration of the 7th ion in moles/cc. at that point 
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at which the electrical potential V is put equal to zero. Let 


ests pa ATE Cm 


Ris: Der 2) 


Let z; be the charge of the ith ion and « the distance from the membrane. 
Since the ion concentrations are distributed according to 


f= bye « (2) 
Poisson’s equation for the potential V in the one-dimensional case may be 
written (V’ = dV/dx) as 


VP == ewe” — (Cio + Cao) Pie SE ¥ 03 
(3) 


yr" Saget”, al epee iyie ai Be 
C20 


At a point far away from the membrane V”’ must approach zero. Let 
V = 0 at x = —o and denote by V,, the potential at x = ~. We then 
obtain the following conditions from (3): 


Cay = Cio t Cop 5 (4) 
Coe ™ = eye ™. (5) 


If we introduce U = V — V,, and k’ = ke*" = into the second expression 
of (3), then the resulting equation is of exactly the same form as that of the 
first expression of (3). Thus we shall only discuss the solution of the first 
expression of (3). 

If both sides of (3) are multiplied by 2V’, one can see that the integral 
can be written as follows, since V’ = Oatx = +o: 


, 2k a =a b? a —a Vi OF 
(V2) Pe Cet plete! may ee al giana ID HG ee WhO ee) 


We may take the square root of both sides, separate the variables and 
integrate. Note that if sinh (aV/2) is introduced, the solution may be seen 
by inspection. If 


5 ee b et Van/? (7) 


and if K, and Ky are integration constants, the solution may be written as 
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follows: 
2 LBs eo 
V= . 
a STE Kem z<0,| 
Pec keet = er 
ae 'l6g sais Se Viens x>0Q. 
1— Kye > ) 


If, for example, V,, < 0, then 1 > K; > 0,1 > Ky >0,andd>0. 

The constants K, and Ky are determined from the condition that the 
potential and its derivative are both continuous across the membrane at 
x = 0. They are found to be given by the following expressions: 


144 4V2V1+42?-1-4 


ida ot Ate A: ie 


+24 V/2V1+4+ A?—1-—3A? 
K,= 48 ; (10) 


where 
Had (11) 


That the electric field is continuous across the membrane can be seen 
as follows. The space integral of the charge must be zero. Hence, from (3), 
the integral of V’” must vanish, ie., V’(#) — V’(+0) + V'(—0) 
— V’(—~) = 0. But V'(+ 0) = V’(—~) = 0. Hence V’(—0) = V’(0) 
so that the field is continuous across the membrane. Furthermore, if the 
potential is not continuous, V’ = ~ at x = 0. Evenif this is so, it can be 
shown that the boundary conditions can be satisfied and the infinite field 
is balanced by discontinuities in concentrations. Consider a finite mem- 
brane in which the potential V., is very small, but in which there is a 
work function ¢;3(x) associated with the Rt ion, $3 being continuous and 
finite; then both V and V’ are continuous. Now permitting the membrane 
to become infinitesimal leaves V continuous even though the work func- 
tion 43 becomes discontinuous. Examples will be given subsequently. 
~The concentration c; of any one of the ions is then given by the following 


expression: 


bz, —2z; 
ime tee) bee Peat 
Cio 1+ Ky ee : 

—b' n —22. 
Gen grote (ALBEN Me 20. 
Cio 1— Kae? 


(12) 
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In the case of the numerical example given by Bartlett and Kromhout 
(loc. cit.), Wao = —100 mv., [Cl] at * = —© = Go = 107 M/I 
= 10-? M/cc., D = 80, T = 300, so that c¢3o = 0.099955 M/l, cro 
=45 X10°M/l, Ki = 0.166, Kz = 0.664, 6 = 1.00 X 10’, and 0’ 
= 1.47 X 10°. Substituting these values into (8) and (12) leads to the same 
results as those given graphically by Bartlett and Kromhout (Joc. cit.) since 
they let the size L of the “cell” become much larger than 1/2’ = 68 A. 

Since only V., enters the final expressions we may replace a completely 
impermeable membrane with one in which there is some work function 
$3, which represents the work done by non-electrical forces in moving one 
R+ molecule to the left through the membrane. Then when x > 0, we 
subtract the quantity a¢3 from the exponent of that term in expression 
(3) which has the factor c3o. Thus ¢3 is measured for convenience in e.s.u. 
vols, even though ¢3 does not represent an electrical potential. If 63 = ~, 
we have the case just treated. If ¢3 is finite, we only change condition (5) 
and in its place write 


Cy = (Cy tte) e (13) 
Eliminating cio from (13) by using (4), we have, on solving for V., 
1 


Vo = on 


log [= 2a—« |: (14) 


The quantity ¢3 may be obtained from the expression: 


ee ace, C39 — Cot Co "= 
b= log, | “2 Cet]. (15) 


If R* is completely impermeable and is initially present on both sides 
of the membrane in unequal amounts, then the spatial distributions of 
ions will be the same as if R+ were a permeable ion, but of such a kind that 
the membrane did an amount of work on it equal to that done on it by an 
electrical potential ¢3. 

More generally, if we have m ions, the concentration of the 7th being 
ci, the charge being z;, and if the work function across the boundary from 
right to left is ¢;, then the solution is given by equations (8) to (12) with 
the following conditions replacing (4) and (5): 


>, fi Cig = 05, (16) 
i=1 


D t:cige tee = 0, (17) 


t=1 
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Note that if the equilibrium ratios of the concentrations are defined by 


Ci 
am 2, as) 
then 
R; ad; : 
jae ti) for PS rare O) 
j (19) 
RR;= fears for 2+ 2;=0. 
If V., is known, then ¢; is given in terms of R; by 
RT 
bi = Fp log.R: — 2;Vo. (20) 


We next consider two cases in which ¢; varies continuously with x, ¢3 
being essentially constant except within a given region. We do this in 
order to estimate the influence of the size of this region upon the space 
distribution of the potential. Let ¢: = ¢2. = 0. First let the region be 
characterized by a space constant A in such a manner that 


$3 (x) = 7h30e7, «<0 } 
3 (x) =d30 — FG3n€ 7, ta Ue 


(21) 


In this case the derivative of ¢3 is continuous and symmetric about the 
origin. Let a¢3. K 1 and aV < 1. If we let 


a?=2ak, ss (22) 
20 


then the expression corresponding to (3) may be written 


v"=aV+Ber, SUM (23) 


V" =a? V —Be+26, x>0.J 


Since V and V’ are continuous and V(— ~~) = 0, we find, except for 
av = l, 


B e* BY? er/> < 
Virrrea aay ake cae 
EELS ESAs e 
a?(1 — a*)?) a (24) 
pe BN eo 2 AB #207 


A comparison of expression (24) with the corresponding expression in 
which } = © will show how the finite size of the region effects the space 
distribution of the potential as well as that of the ion concentrations for 


this case. *S af ; 
In a similar manner we can treat the case in which @; is a linear function 
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within the membrane of thickness 2A, being zero to the left and a constant 
$3.. to the right of the membrane. The solution for the potential V is then 
given by 


B ( er 4 — e aA) ear 


— t< 0, 
iy 2a%A i 
e724 ( ear — e— 2) B Bu is ae 
eae © ct Re a age = A 2S 
V 2a%A az aA’ Ate j 20 
ef, RIMES AD las FRSA Vine 


~ 203A a? 


This expression may be compared with expression (24), noting that 
A ~ }, or with the expression resulting from letting A = 0. Note that 


TABLE I 
R; ab; ¢: in ‘‘mv.” 
Vio 
PREPARATION : 
In my. 
Ke Nat cis Ke Nat GE Kt Nat ce 

Loligo Axon...... 61 35° 1 0.45.) O13 4" 222 122.3 3°| 30 |—110 8 
Sepia Axon....... 62 52> ft Ostaole 5. See Pe By a ee a ee 20) == 11a sl ee 
Frog Sartorius 88 46 | 0.17 | 0.032 4 |—5.2 0; 10 |—134 0 

(Ringer)... ...- 
Frog Sartorius 

(Natural) ce. 95 AG | ees y 0.017 oy. emer —.4 ee er —10 


expressions (24) and (25) become identical if A = \ = 0, the potential - 
remaining continuous. 

If we assume that the ions in a nerve are in equilibrium with electrical 
and other forces (cf. Franck and Mayer, 1947; Hodgkin, 1951) we may 
calculate the quantities ¢; from (20). As an illustration, a few values are 
given in Table I in which the data are taken from Hodgkin (loc. cit). 
Listed in the table are a) the membrane potential, V; 6) the concentra- 
tion ratios R; for K+, Nat, and Cl-; c) the work assumed to be done by 
the membrane given both as ag;, the ratio of the work done to RT ; and 
d) as $j, the equivalent electrical potential on the ion in which each ion is 
treated as having a single positive charge. A positive value for ¢; cor- 
responds to a force directed toward the inside. From the results in the 
table and from the above model, it is clear that there is no appreciable 
active force in the Cl- ions but a very large outward force on the Nat ions. 
The K+ ions would appear to have a small inward force acting upon them 
in some cases. 
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A mechanism is described which accounts for the active transport of Nat ions through a 
membrane. It is assumed that at one side of the membrane the ion combines with a carrier ion, 
the resulting carrier compound then diffuses through the membrane and decomposes at the 
other side of the membrane. The free diffusion of the ions is also taken into account. The time 
rate of accumulation of the ion in question at the latter side of the membrane is calculated in 
terms of the concentrations of the ion at both sides of the membrane. 


Recently A. L. Hodgkin, A. F. Huxley, and B. Katz (1949) proposed a 
carrier model to account for the changes in the permeability of the nerve 
fiber membrane to Nat ions. H. B. Steinbach (1951) describes a specific 
Nat exchange system to account for the outward movement of Nat ions. 
In this model it is supposed that a specific sodium complex NaX is suf- 
ficiently diffusible in the membrane to exchange sodium with the internal 
and external phases. The symbol X~ represents an unknown anion, per- 
haps a polyphosphate, formed by the metabolic reactions of the cell. We 
will study such a picture here in more detail. 

We assume that at sides J and JJ in regions of thickness 6 (Fig. 1) Nat 
ions react with the carrier ion X~ under formation of the complex molecule 
NaX according to the reaction: 


k 
Nat +X = NaX . (1) 
=I 
Let the equilibrium constant K be defined by: 
Kop otNaty EX 
Due to different conditions at the sides 7 and //, K will in general have 


different values at J and J/. , 
The rate of accumulation of the compound NaX at the side J in a re- 
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gion with cross-section A, 


d(NaX] 
Soe (CT 
is given by the difference between the net rate of production of [NaX] 
according to the reaction (1), 46f1,[Na*]:[X-]; — A6k4,[NaX]7, and the 
diffusion flux 
[NaX],— [NaX] 1 


A Dyax d 


The diffusion constant of the compound NaX in the membrane is Dyax 
and d is the effective distance across which the diffusion takes place in the 


i Ir IN c]_ 


[Na x] - 


d 4 


r) 
O LEVEL OF CONCENTRATION 


FIGURE 1 


membrane. In the remaining part of this paper we will assume d> 6. 
Therefore we have: 


Ad CINOX I]t 4 5s, 1N Nat],;[X—],;— 46k-1,[NaX], 
= A Doig NOEL eS [Ne 
For the total balance of the X- ion we have: 
et Fine [Nax];= Xo, (4). 


in which we assume Xo, to be determined by metabolic reactions at the 
side J of the membrane. 
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Similar to equation (3) we have for the free Nat ion: 


ja tNat ls 


pee 


A 5k_-1, [NaX];— A 5ki, [Nat] 7(X7], 


[Na];— [Na] x 
d 


(3) 
— ADy, 


.in which Dy, is the diffusion constant of the Na+ ion in the membrane. 
Analagous to equations (3), (4), and (5), we have at side JJ of the 
membrane 


d(NaX 
A petNet lr bhi, [Nat] (XW) y= Abby (NaX ; 
rs (6) 
a Tie [NaX], 2 SUES 
[X~] a+ [NaX] n= Xo,,, (7) 
d{Nat 
Aé aol A6k_1,,(NaX] 1 — Adki,, [Nat] w(X7) a 
8 
fied Das 2) SRE san (8) 
We will assume that, due to different conditions at J and JJ, 
ote a eand ehg, on (9) 


Then we may say that the NaX molecule is formed at J and decomposed 
at IJ. A similar result would follow from the assumption that Xo, is large 
due to metabolic synthesis of the carrier ion X~ at J and Xo,, small due to 
metabolic breakdown of the carrier ion X~ at JJ. 

Furthermore we will suppose that the [NaX]; and the [NaX]z are in a 
steady state 


d(Nax], 

are ee 
d{NaX)iz_ 
ar ee : ay sy 


From (3) and (10) we obtain het 


Bh, [Nat] 1X7}, — (88a, + 3) (NaX] + 


ONE (NaX] 1 = 0. (12) 


From (4) we have: 
p Hy) [NaX];= Xo, — eee. ee Nee 
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and, similarly, from (7): 
[NaX] 1 = Xo,,— [X7],,- (14) 


Substitution of (13) and (14) into (12) yields: 


(Sky, [Nat] + 68a, + 2B) XV — SF 


(15) 


Similarly, we obtain from (6) and (11) and substitution of (13) and (14) in 
the result 


D 
= SA +(4 bi,, (Na) F Skog a) [X-] 


(16) 
a Dysx 
d 


Dx 
Xo, ts é Ray + od) Xo, = 


Assuming that even k1, < hki,[Nat]; [cf. (9)], whereas the terms 6f1,[Na*]z 
and 6k_1,, are of the same order of magnitude as Dy,x/d, we may neglect 
all terms in k_;, and ,,, since [Nat] < 1 in physiological circumstances. 
This amounts to neglecting the breakdown of NaX at J and the synthesis 
of NaX at JJ, in line with what we have said above. The equations (15) 
and (16) then simplify to: 


Dya D D D 
(oh, [Nat] + 28) [xo] = PB] = FBX PEE, (17) 


and 


= PBK] (Bhat PM) KL r= — PRE XS, 
(18) 


D 
+ (bby) + 78) Koy: 


Solution of the equations (17) and (18) for [X-]; and [X~—]y leads to: 
2 = bRo1yXo, 
[IX-];= ar WEMe Se o6 Ty pos (19) 

Shy, (88-1, 3) [Nat] + 5h, 


and 


D 
[X—] "9 ioe [ 5&1.) ‘Fee nes (Xo, — Xo,,) te Bhs Xoy7} [Na*] I Q ) 
0 


"0 
+ gDroxk—1yXoy | / bby( bb-ayg+ 3) [Nat] +28 Shay. 
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From (7) and (20) we obtain: 


Dyax 
J Ok1,Xo, [Nat] x 
[NaX],,= (21) 
D ad D aX : 
tes Chee 7 )bky (Nat): 


Substituting (21) into (8) and again assuming that, according to (9), the 
term in k;,, may be neglected, we. have: 


Dyax 
d (Na*] x d 6ks,Xo, [Nav] 1 
ond = nD D 
FOR ay + (8h + —F*) Bhs, [Nat] 
(22) 
pe [Na*]7— [Nat] a 
d 
Let us suppose here that 
18, a D a. ; 
= Sky, >( 08-1,,+ obs, [Nat] z, (23) 
which on the basis of our former assumptions amounts to: 
bap (Nat )s - (24) 


We may then, according to (23), simplify equation (22) to: 


d [Nat Dya 
: fi Itt = bh Xo, [Nat] ,+ 7 


{ [Nan [Na*] zr} : (25) 


The last term in the right-hand member results from the transport of the 
Na+ ions through the membrane by free diffusion. The first term describes 
the accumulation at side ZZ of the membrane due to the active transport 
of Nat ions through the membrane by the carrier mechanism. As far as 
our approximations hold, we see that the result of the latter is the same as 
that of a constant force acting on the Nat ions against a resistance force 
which, e.g., as in Stokes’ law, is proportional to the velocity of the Nat 
ions, resulting in the steady state in a constant velocity f of the Na* ions. 
The flux of the Na+ ions to the right would then be equal to f(Nat), and, 
by comparison with the first term in (25), we see, according to the mecha- 


nism described above, that: 
f = 6h1,Xo, - (26) 
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The recent extension of the approximation method is applied to enable us to arrive at the 
time course of the concentrations at both sides of a membrane. From the differential equations 
which govern these, the steady-state solution is obtained in terms of the parameters, which 
are determined by the thickness of the diffusion layers, the chemical composition and re- 
actions, and the diffusion constant of the membrane. 


Recently the approximation method of N. Rashevsky (1948) has been 
extended (Landahl, 1953) so that by this method not only the dependence 
of a variable on a space coordinate but also on the time coordinate can be 
treated. This method will be applied here to study the formation of a 
difference in concentration of an ion across a membrane in which a con- 
stant force K acts in one direction on the ion (Fig. 1). Due to the re- 
sistance of the ion in the membrane to the motion, taken to be linear as in 
Stoke’s law, the velocity of the ion inside the membrane will be propor- 
tional to the force K acting on the ion. Calling that constant velocity f and 
assuming, to fix ideas, that the force and therefore the velocity is directed 
to the right, the flux across any plane perpendicular to that velocity will 
be fc, c being the concentration just at the left side of that plane. On the 
basis of a carrier mechanism, G. Karreman (1953) has obtained a result 
similar to that of this formal picture. This model will be used in this 
paper. Let us assume that initially the concentration of the ion is equal to 
z at both sides of the membrane which is assumed to be of infinitesimal 
thickness (Fig. 2). After the mechanism under consideration has been 
active for a time ¢, the concentration at one side will be ¢ — a1 and at the 
other é + co (Fig. 2); it will be equal to ¢ at one side a distance 1 away, 
whereas it is also equal to é at the other side a distance 7. away (Fig. 2). 
At any moment the concentration is supposed to be a linear function of the 
distance from the membrane to the left and similarly to the right of the 
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membrane. During the formation of the difference in concentration the 
quantities c1, ¢2, 71, and 72 are all functions of the time #. 

At any moment the flux at the left of the membrane must equal the 
flux inside the membrane as well as the flux at the right of the membrane. 


MEMBRANE 


FicureE 1 


FIGURE 2 


If / is the permeability of the membrane and if D, and D, are the diffusion 
coefficients on the left and right sides of the membrane, this continuity of 
the flux F leads to 


P= Di t= — hort or) +h (2-4) = De. (1) 


Furthermore the total change in the amount of solute in the region to the 
left of the membrane is given on the one hand by the time integral of the 
flux and on the other by the space integral of the change in concentration 
on either side of the membrane. Therefore we may write 


f Fat=s na=t ne (2) 
, D) Bet 2 Q= : 
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For reasons of mathematical simplicity we will assume in the remainder of 
this paper that 

D,=D.,=D. (3) 
From the second and the last member of equation (1) we obtain, using (3), 


Gy G3 


aa (4) 


ipa 7:3 
From equations (4) and (2) we find 
CL— 6 Cb) @) 
Substituting (5) into (4) we obtain 
ccmeies =r (i). (6) 
Differentiating (2) and using (5) and (6) we find 


pe ee Oe 
ES 5 To om gee (7) 
Equations (1) and (7), together with (5) and (6), provide two differential 
equations from which the transient time course of r and of ¢ can be deter- 
mined. If we assume the membrane to be imbedded in an infinite medium, 
then r can increase indefinitely. Substituting (5) and (6) into (1) we have 


F=D<=f(é—c) —2he. (8) 
If we define ¢(c) by 
o(c) =f(té—c) —2he (9) 
we obtain from (8) and (9) an expression for r: 
De 
= 10 
Corrie at 
Differentiating (10) we find 
.- n 6¢(c) — co(c). (11) 
Espa bP GE) 


Substituting (11) into (7) we obtain, together with (10) and (8), 


1, dole) cle) 41, De 12 
2 el Bi @? (c) +5 © (oc) $ (Cc) ’ ( ) 


or, after some simple rearrangements, 
D(2fé) cdc D(f+2h) ede _oqy (13) 


[fe— (ft2h)e}® {fe—(f+2h) cj 
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Integration of (13) gives us: 


C 
2 ——1 
D f+2h ¢ 6224 =2t. (14) 
GF vay? (lee (1 - j )+ (aatseerhy 
f+2h ¢ 
Introducing 
faz, (15) 
é 
we have 
1 
Qhe pn? f+2h 5 Dhak (2 
CRED p= t0g(1- 25 + si rel eg 6} 
D mentee 
f+2hA 2 
From (16) we see that for t = ~, corresponding to the steady state, 
see Ye} 
B= Sa = FoR? (17) 


a result which could have been obtained immediately from (7) with 
7+ = 0 and ¢ = 0 together with the expression in the third member of 
(1) for F. In this respect it might be remarked that in this case F = 0. 
This is not the case, in general, in a steady state. But if, as in this model as 
stated above, the thickness of the layers at either side of the membrane is 
infinite, then in the steady state r will also be infinite and for a finite 
steady-state value of c the gradient and, therefore, F will be zero. 
Introduction of (17) into (16) leads to the equation 
1 
(18) 


Oig e 
(=-1) 


The half-life ZT is obtained by substituting z/z,, = 3 into (18). The 
result is 


2 
T= 0.81 2. } (19) 
For a ratio 

ZF <2 50 (20) 

we easily find 
Zam = 0.96 (21) 

and, therefore, 
Ta O,Ph (22) 
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This formation of the difference in concentrations at the sides of the mem- 


brane will therefore be slow compared with the diffusion in the layers next 
to the membrane, if 


he EH LD 
O4 = —_ 
14 >>> ap ° (23) 
This inequality holds for 
hee 14. Cn. Sec (24) 


when D = 10 cm.” sec. and ZL = 10-4 cm. Note that f has the dimen- 
sions of a permeability so that it may be compared with the passive per- 
meability, 10~* cm. sec.—' for potassium, for example. 

From (18) we have for the steady state: 


c=—-~— ¢. (25) 


Therefore we have for the ratio of the concentrations at the two sides of 
the membrane 


er pele (26) 


Substituting the expression for f obtained by Karreman (1953) and intro- 
ducing D,, = hd, we obtain 


(27) 


by which the ratio in the left-hand member of (27) is given in terms of the 
physicochemical parameters occurring in the right-hand member (cf. 
Karreman, 1953). 

This work was aided in part by a grant from the Dr. Wallace C. and 
Clara A. Abbott Memorial Fund of the University of Chicago. 


LITERATURE 


Karreman, G. 1953. ‘‘Some Considerations on a Carrier Mechanism as a Model for Ion 
Accumulation.” Bull. Math. Biophysics, 15, 161-65. 2h 

Landahl, H. D. 1953. ‘‘An Approximation Method for the Solution of Diffusion and Related 
Problems.” Bull. Math. Biophysics, 15, 49-61. rae 

Rashevsky, N. 1948. Mathematical Biophysics. Revised Edition. Chicago: University of Chicago 
Press. 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 15, 1953 


CONTRIBUTION TO THE MATHEMATICAL THEORY OF CON- 
TAGION AND SPREAD OF INFORMATION: I. SPREAD 
THROUGH A THOROUGHLY MIXED POPULATION 


H. G. LanpAu AND A. Rapoport 


COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


An equation is derived from the spread of a “‘state” by contact through a thoroughly mixed 
population, in which the probability of transmission depends both on the over-all duration of 
the process and on the time an individual has been in the ‘‘state.”” Cases in which this prob- 
ability is a function of only one or the other of the two “‘times” are worked out. It is shown 
that in the case of dependence on ‘‘private time” alone the asymptotic value of the fraction of 
the population affected is the same as that derived by the random net approach. 


1. Introduction. Consider a population whose members are thoroughly 
mixed, that is, they come in contact with each other periodically in such 
a way that the probability of contact between any two individuals is the 
same for any pair. We wish to study the spread of a state through such a 
population. A state is something which is transmitted by contact. An indi- 
vidual acquires the state by coming in contact with another who had 
already acquired it and has not yet lost it. 

In general, there may or may not be “recovery” (loss of state) and 
“immunity” (impossibility of acquiring the state more than once). The 
period over which an individual may transmit the state may be limited, 
and transmission during that period may occur with variable probability. 
All this terminology has obvious implications in the description of a com- 
municable disease epidemic. Moreover, the spread of information through 
a population, or of a behavior pattern (by imitation), can be described in a 
similar language. 

In this paper we shall be concerned with the case where there is no re- 
covery or, at least, the individuals who have once acquired the state con- 
tinue to be counted as having it, although, in some cases, the probability 
of transmitting the state to others may vary with the time that an individ- 
ual has been affected. Therefore the state can be acquired at most once by 


each individual. ee: peithe 
The general equation for the spread of a state which is derived in the 
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next section is similar to that given by W. O. Kermack and A. G. Mc- 
Kendrick (1927) for epidemics. Our equation, however, includes a more 
general dependence of the probability of transmission on time. In addi- 
tion, it is pointed out below that the interpretation of the threshold phe- 
nomenon given by these authors needs some modification. 

In recent years, stochastic methods have been applied to studying the 
growth of biological populations, and a start has been made toward 
studying epidemics by these methods (Bailey, 1950; Landau, 1952). The 
stochastic method aims at finding the probability that a certain number 
of the population will be affected. Here, however, we are concerned with 
the deterministic theory which deals from the start with the average 
number affected in a large population. There appears to be still a number 
of unsolved problems in this theory. 

2. Equation for the spread of a state. The most important function in the 
equation for the spread of a state is the probability that the state will be 
transmitted when a contact occurs between an affected and an unafiected 
individual. In the simplest case, this probability is constant throughout 
the process. The spread of the state then follows the well known logistic 
equation, where the fraction of individuals affected approaches unity. 
We shall consider the more general case where the probability of trans- 
mission p(t, tT) may depend both on the time ¢ of the whole process and 
also on 7, the time since the particular affected individual acquired the 
state. We shall call ¢ “clock time” and 7 “‘private time.” 

We introduce the further notation: 

N = number of individuals in the total population, assumed constant during 
the spread of the state, 

«(¢) = total number of individuals who have become affected up to the time ¢, 

%9 = number of individuals who became affected at ¢ = (, the start of the 
process, and 

a = frequency of contacts, assumed constant. More precisely ax(#)(V — «(t)) 
is the number of contacts in unit time between affected and unaffected in- 
dividuals. 


Now the increment to «(#), in the interval At, which results from con- 
tacts between unaffected individuals and those who became affected at 
time X will be 


Ax (t) =a(N —-x (t))Ax(d) p(t, t—d) AL. ac 


The increment to «(¢) resulting from contacts with those individuals who 
were affected at the very start will be 


a(N — x (t)) xop (t, t) At. (2) 
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To get the total increase in x(t), we replace Ax(X) in (1) by 


dx (i) 
Oe. 


Ad, 


sum over ) from 0 to #, and add the term from (2). This finally gives 


t dx (X) 
dy 


dx 
dp KN = 2%) [xop (t, t) + p(t, t—r) dQ], (3) 
0 
which is the desired equation. 
It is also useful to have the equation for the fraction of the population 
affected, 2(#) = x(t)/N. We have, then, in terms of this fraction 


d ‘d 
wr Ad— slap Go +f Fy P(t t—d) adl, (4) 


where A = aN, and 2 = x/N. 

The necessity for allowing p(é, 7) to depend on both kinds of time is 
easily seen. In the case of a spreading rumor, it means that the probability 
of transmission may depend on both the age of the rumor—say, its news 
value—and on the individual’s urge to transmit it, which may be deter- 
mined by when he heard it. In the case of a disease epidemic, the infec- 
tiousness of the pathogenic organism may decrease with time, perhaps 
because of mutations or of some other organism parasitic upon it. The 
dependence of p(t, 7) on 7 will be due to the characteristic course of the 
disease and may be analyzed (Kermack and McKendrick, Joc. cit.) as fol- 
lows. Let (7) be the rate of infectivity per contact at time 7 after infec- 
tion of those individuals who remain sick, and let (7) be the rate at which 
infected individuals are removed from the sick group by recovery or death. 
Then 

a Svan 


is the fraction of the sick whose illness has lasted a time 7 and 


ies Ce et 


If ¢ and y are both constant, this reduces to 
pli, 7) =pe", (6) 
which is one of the special cases considered below. 
3. The probability p(t, 7) is a function of t alone. Here the contagious- 


ness of each individual is independent of the time when he became af- 
fected (of his private time), but does depend on the total age of the proc- 
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ess. Equation (4) reduces to 

dts) (Amp) +400 f ds] = 4p) 21-2). (7) 
dt 0 
Separating the variables, we obtain 


dz 


els ha Se 8 
SST) Ap (t) dt, (8) 


whose integral is 


log pag= 4 f Pat + log 7, 
(9) 
Cth nina 
ease | 
where 
C=3 he (10) 


In the special case where p(/) = p, a constant, (9) reduces to the 
logistic equation 
C et 


== Tice 


(11) 
where a = Ap. Hence our equation (9) is a simple generalization of the 
logistic equation. We have immediately the following result concerning 
the asymptotic value of z, z(), the fraction of the population ultimately 
affected. 

Theorem 1. Let p(t) be integrable, that is, fi p(t) dt = b < ~, then 

0 

the asymptotic value of z is given by 


Ab 
#(@) = oon. (12) 


If p(t) ts not integrable, z(@) = 1. 

The constant 6 has the dimension of time and can be called the effective 
time of the contagion process. Theorem 1 can then be restated as follows: 

If the probability of contagion depends only on the age of the process, then 
the asymptotic value is a logistic function of the product of contact frequency 
and effective time of contagion. 

4. The probability, p(t, 7), equals e~*". Here the strength of contagion is a 
decaying exponential function of the ‘private time” alone and does not 
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depend on the age of the process. As noted above, this can be interpreted 
as due to a constant rate of infectivity while the individual remains in the 
infected group, and a constant rate of removal of individuals from this 
group. A constant as the coefficient of the exponential is omitted, since it 
may be considered to be absorbed in the A. We have, then, 


(om ead ee (13) 
: k 


as the average number of effective contacts per affected individual. Equa- 
tion (4) now becomes 


dz 


dz 
Zz —Ké EE kr 
ha = le Zo € +Ae ‘f Gee dr], (14) 


which may be written as 
 @ ar “Bil 
— & FT log(1— 2) = Am +A f Setdr. (15) 
Put log (1 — z) = y. Then z = 1 — e”, and dz/dt = —e%dy/di. We then 
obtain by differentiating (15) with respect to ¢ 
ytky=Aey, (16) 
where the dots indicate, as usual, time derivatives. The first integral of 
(16) can be obtained immediately: 
ytky=Ae+K, (17) 
where K is a constant of integration. We can evaluate K by noting that 
y(0) = log (1 — 20), and y(0) = —A2zp, as is evident from equation (15). 


Therefore ree ee ay (18) 


Equation (17) now becomes 
ytky=A(ev—1) +k log(1— %). (19) 
Resubstituting z for 1 — e” and separating the variables, we obtain 


a8 tie (20) 
(1 — 2) {log -—* +4: 


ees (20) can be solved for ¢ in terms of a quadrature: 


=f Bf =a (21) 
0 


Za s) [4 z+ k log z= a 


which implicitly defines z as a function of t. 
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We are interested in the value of z() as a function of the parameters 
zo, A, and k. As we shall show, 2(@) depends not on A and & separately, 
but only on A/k = a. Clearly z(~) is the smaller root of the denominator 
of the integrand in equation (21), and by the nature of the problem, this 
root is not greater than unity. It is therefore the root of the transcendental 
equation 


k log =a (22) 


= 
Taking exponentials, we find that the asymptotic value s* = z(~) must 
satisfy 


z*=1—(1— 2%) e-*. (23) 


If zo = 0, (23) reduces to the equation derived by R. Solomonoff and 
A. Rapoport (1951) and independently by H. G. Landau (1952) for the 
“weak connectivity” of a random net with axon density a. Equation (23) 
can be solved for a, namely, 

1— nA 
1— 


: — log (24) 


5. The quantity p is a function of r only. Equation (23) for the asymp- 
totic value of z holds generally when p(t, r) = (7). This is proven in 
Theorem 2 below. This result is also obtained by Kermack and McKen- 
drick (Joc. cit.). 

Theorem 2. If p(t, r) is a function of 7 alone, p(r), and if 


z*=1— (1—%) e-@". (26) 


Proof. For p = p(r), equation (4) is 


f 
Gn7Ad— alae +f Fee-war], (27) 
which may be written as 
d ques 
Ga A(l- a) J 2 PO)ar (28) 


or 


d 
— hog (1 = 2) =45 [stn pvan. 


eae. ge iy a Et 


ERRATUM 


To the paper by H. G. Landau and A. Rapoport entitled “Contribution 
to the Mathematical Theory of Contagion and Spread of Information: I. 
Spread through a Thoroughly Mixed Population” (Vol. 15, pp. 173-83). 

In the above paper, equation (29) on page 179 should read 


1—s=(1- s)exp[—4 f 2—» paar]. (29) 


~ a a a, . 
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On integration, we have from (28) 
a |-4 fz» poral]. (29) 
fe 4 


Now it can be seen that the assumptions 0 < z) <1 and p(r) > 0 
result in 2(¢) having a positive derivative and 2(f) < 1. Hence 2(t) has a 
limit z* as t + ©. Then, writing z(/) = z* + 0(t), we have 


Af 2-»PO)drn= det [part fou—r» pan 
0 0 


= 4st [pO at4 f'9t—n pata f'OG-v pan. 
i) 0 ty 


Fori > ~, f 9 — \)p(A)dX has the limit zero, because for ¢, bounded, 
0 


t—X-— © ast — o and hence A(t — A) > 0. On the other hand, from 
(25) | 
J 8(t—d) P(A) dr 
ty 


can be made as small as desired by choosing ft; sufficiently large. 
Equation (26) then follows. 
The following series expansion for z* in terms of a is readily obtained 
from that for 29 = 0 (Landau, loc. cit.): 


eT A= 5.) 3 ee (30) 

Now it is easily seen from (26) that if 29 = 0, then fora < 1, 2* cannot 
have a positive value. Since in actual epidemics the number of original 
cases is usually a very small fraction of the total population, Kermack and 
McKendrick (Joc. cit.) concluded that the value z) = 0 approximates to 
reality and that, therefore, there is a threshold for epidemics at a = 1; 
that is, no epidemic for a < 1 will occur, and an epidemic will occur for 
a > 1. We can express this in terms of the population size by writing 


if “p(a)dd = 1/k and using A = aN, so that the condition becomes 
0 
N ghy (31) 
a 


As can be seen from the figures, there is, strictly speaking, no threshold 
for small} zo at a = 1. There is, however, a rather sharp change in 2* near 


+ The value 2» = 0 is, of course, only a limit, which never actually occurs, because for 39 = 


_ 0 the only solution of (27) is 2(¢) = 0 for all f, that is, no infection at all. 
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a = 1. We can calculate z* more exactly by writing (26) in the form (24), 
then expanding for small zp and 2*: 


amit+der—. (32) 


This gives for 2*: 


z*=a—14+ V(a—1)?+2%, C33) 


' 2 3 4 5 


Ficure 1. The asymptotic value z* as a function of a for a few values of zo: 
s* = 1—(.—in)e=*. 


from which we get the approximations 


cg 8 

A pana fora<1, 

st~VJ/22z, fora=1, (34) 
Zo 

a*c2(a— 1) +——> fora>1. 


It would be worthwhile to investigate epidemic spread for small zo by 
stochastic methods which would give a more exact statement of the prob- 
ability of a large epidemic resulting from a small number of original cases. 

6. In a finite interval p(r) is constant. Here the contagiousness of each 
individual is constant throughout a finite period and then suddenly ceases. 
Without loss of generality we can set 


p(r) =1 for! $6, p(r) =0 fort>s. (35) 
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Equation (4) then becomes 


dz 
G42 — 2) for rS6, 
(36) 
ag 
wie a) 9). NOE SO. 


This is the equation derived by A. Shimbel (1950) and is a special case of 
the equation given by E. B. Wilson and M. H. Burke (1942) and of a 


z* 
te 


.08 


-06 


.04 Z,= .02 


02 


Ficure 2. The asymptotic value z* for some small values of Zo 
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similar equation of M. Puma (1939). The time-dependent solution can be 
given by the recursion formula 


Pitt 4 (1 — ay) Laat — OI, (37) 
where 
ef 2 aS 
Sr Cee aie: 


is the solution of (36) in the interval 0 < ¢ < 6. The functions z,(#) are the 
corresponding solutions in the intervals (n — 1)6 S$ ¢ < né. 

The solution of (37) can be given as follows. Let s = At. Define func- 
tions F,,(s) and w,(s) as follows: 


F,(s) =f (1— a (s)1ds 


(38) 
e — F,(s) : 
a ae i= Zn+1(S) 
Then 
dwn +1 Sn41 (5) 
Saae Reame tl see tS 21 9 —1 n ee 
ds wen (s) | tal) +7] (39) 
=Wnrti(s) [—1+ ees: + Zn41(s) Ss eS.) = — eo rats) 
Thus 
= 1 — F,(o) 
NG ten een -ax my eyed Oh da, 
giving 
( ) 1 e7 Fnts) 
Stills) = = = : (40) 
1 2,(248) ] 8 —F,) 
[1—2,("A 6) ] f de 
This gives for z2 
—A(t—S) 
silt) ‘xsi iat aoe a a 
(C+1) (14+Ce4*) — AC (¢— 8) —14 e— 40H)? (41) 


for 5<t<26. 


The calculation of z, by (40) is straightforward but soon becomes very 
complicated. The asymptotic value is again given by (26). 

7. A lag in p(r). If there is a lag in the transmission of contagion, we 
take p(r) = 0 for r < 4, a constant. In this case, we must have z = 2 
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for ¢ < 6. We can write equation (29) in the form 
t 
z=1—(1—s)exp| — 4 fos(t—2) pa) da], (42) 
Putting u» = t — d this becomes 
t—é 
z2=1- (1—2,)exp| — A -- 3 
0) exp| Jf 2) p= du]. (43) 


Now it can be seen that the right side does not involve 2(), so that this 
equation allows us to calculate z in successive time intervals of length 6. 


Put 
a(t) = 2 (i), for n6<t< (n+1)6, 
then 


ai() =1—(1—sy)exp[— 4 fo mp (Un) du], 


iret (1 sy)exp }—4[ fp —n) du 


+ fo 'aWp-u)anlt, 


and, generally, 


q(t) =1— (1-29) exp \-4[ fr (=n) dut fo x(n) p= wd 


t (44) 
a: athe yt wb (t= nw) du], fornia 


This work was aided by a grant from the Dr. Wallace C. and Clara A. 
Abbott Memorial Fund of the University of Chicago. 
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CONTRIBUTIONS TO THE MATHEMATICAL BIOPHYSICS 
OF THE CARDIOVASCULAR SYSTEM 
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The reflection of pressure waves in a fluid enclosed within a tube with an elastic wall is 
studied for the case of a localized change in diameter of the tube. The concept of impedance is 
introduced. The relation of the reflection characteristics of the parts of the tube at either side 
of the change is derived on the basis of the continuity of pressure and mass flow at the site of 
the change. This relation is used to derive the expression for the ratio of the pressure oscilla- 
tions measured in front of, and behind, the constriction in terms of the constants of the system. 
As a result, a method is indicated to locate the coarctation from measurements of the pressures 
in front of, and behind it. 


In recent studies of the pulse pressure, the treatment of the reflection of 
pressure waves in the circulatory system has been emphasized. According 
to I. G. Porjé (1946) the lack of a theory of such reflections is the main 
obstacle in the study of the pulse wave. A similar impression was ex- 
pressed by A. Apéria (1940). In particular, the study of the standing wave 
pattern in the aorta and large arteries by W. F. Hamilton and P. Dow 
(1938) strongly suggests that the pressure waves are reflected in the cir- 
culatory system. This phenomenon of the reflection of pressure waves in a 
fluid enclosed within an elastic tube was studied theoretically by G. Karre- 
man (1952). In this study the case of a localized change in geometrical or 
elastic characteristics of the tube or its wall was investigated. It was as- 
sumed that the parts of the tube at both sides of the change were infinitely 
long, as shown in Figure 1. Furthermore it was supposed that in one part 
there was the incident wave and its reflected wave. In the other part, how- 
ever, there was the transmitted wave and no reflected wave. For this 
simple model the reflection coefficient was expressed in terms of the ratio 
of the radii, of the thicknesses of the wall, and of the elasticity moduli of 
the wall. The present study is an extension of this latter work. 

As is known from the theory of sound, the propagation of pressure 
waves in tubes can be studied conveniently after the concept of the specific 
acoustic impedance has been introduced. By this quantity we mean the 
ratio between the pressure and the particle velocity. We shall proceed 
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similarly here, but shall treat only the stationary state. F urthermore, we 
shall consider the case in which there are only plane waves, as would be the 
case if the wave length were much longer than the radius of the tube. Then 


we have: 
p= Acie) +B eik(rtet) (16) 


in which p represents the excess pressure corresponding to the wave, k is 
the wave number, « the coordinate in the direction of the axis of the tube, 
c the velocity of propagation of the waves, / the time, and A and B two 
amplitudes. As was shown in the work mentioned above (Karreman, /oc. 
cit.), the corresponding particle velocity v is then given by: 


v= Ae 1 A etk(@-¢t) — B e ik (et et) ; (2) 
pc 


in which p represents the density of the fluid and all other symbols have 
the meaning defined above. For undamped waves £ is real. If the damping 
(e.g., due to viscosity) is taken into account, & is complex with a positive 
imaginary part (Morse, 1936, p. 214). 

As is known from the theory of sound (Morse, Joc. cit.), it is convenient 
to write the reflection coefficient B/A in the form of an exponential: 


B 


— a ew 


A ? (3) 


in which y will in general be complex; it describes not only the ratio of the 
amplitudes of the reflected and incident waves, but also their phase dif- 
ference. 

The specific wave impedance ¢ is defined by: . 


p 


2=5 
vy’ 


(4) 
analogous to the case of an electric or acoustic impedance. 
From equations (1), (2), (3), and (4) we obtain (cf. Morse, Joc. cit., 
p. 191): 
2=pctanh(y+7kex). (5) 


Formula (5) shows that the impedance z is a function of x. 
We will apply equation (5) to derive the relation between the reflection 
coefficient, —e~™, in the tube AB in shen 1 and that, —e~™, in the 
tube CD in the same figure. 
Since the velocity of propagation C of the pressure waves in a fluid 
enclosed within an elastic tube depends on the radius of the tube (cf. 
Karreman, Joc. cit.), it will be different in the tubes 4B and CD, Let us 
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denote it by C, and C; respectively. Then we have for the impedance of 


the tube AB, according to (5), 


2 (%1) = pc; tanh (Wi; + 7k x), (6) 
and, similarly, for the tube CD, 
Z(%2) = pce tanh (Wo+ikoxs). (7) 
| 
| 
| 
2R 2A 
= 3 te} 
aed Sa 
Roa= lp apes sala ees 
Ficure 1 


From the equation of continuity of pressure at the site of change of the 


tube we have: 
Pea=?bc (8) 


in which pg and fc represent the pressure at B and C respectively. From 
the continuity of mass flow at the same place we obtain: 


rR? vp = TP VC (9) 


in which vg and 2¢ represent the velocity at B and C respectively. Division 
of equations (8) and (9), using (4), yields 
| Fk 
#-48 an 
From (6) we find for # = Li: 


Zp = pc tanh (¥itikili); (11) 


and from (7) for x: = 0: 
Zo = pcz tanh yr. (12) 


Introducing (11) and (12) in (10) we obtain: 


pes tanh (YitibiLs) _ pce tanh Ya (13) 
R? r? 
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Simplifying and rearranging (13) we have: 
1— e2Witihi Ls) R%cp 


= : 14 
Efees@t Wied @ fice tanh Y2 (14) 


Solution of equation (14) for e~”¥' in terms of e~™ yields: 
R2¢, 1 — e 2 


aS = 
e741 —_ ror 1 ae oes e2tk, Ly Fy Cis) 
1 R2¢. 1 — e7%2 
ies Fe ci 14+ es 


Equation (15) gives the desired relation between the reflection coefficients 
—e—-™ in AB and —e~™ in CD. For the particular case in which there is 


| 
A : 
2R Qn 2R 
| Cc D ] 
| | 
1 
A B E G F 


K— L, — >--L, ~kK-— LL, -— > 
K-L—>| 


FIGURE 2 


no reflected wave in CD, we have e™ = 0. Then, if we take the origin 0, 
at the site of the change (15) reduces to: 


R? 6g — rc, 
eo ty alin Meme EA 
e ii ee (16) 
Equation (16) is the result of the former work (Karreman, loc. cit.) where 
this particular case was studied. 


From (13) we easily derive: 
tanh (—7k,L,) +a tanh ye 


eenn¥i Ten aphd ako tie su 
in which R 
Re 
i, rc," es 


We will now apply this result, (17), to the case illustrated in Figure 2, 
as a model of coarctation. 
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Calling the values of Yin AB, CD, and EF respectively, yi, Wo, and yw; 
we have equation (17) as a relation between tanh Y, and tanh y». A similar 
equation holds between tanh y2 and tanh y;: 


tanh ( —7tk2Le) ++ tanh 3 


tanh p. = i 5 (19) 
1 5 tanh ( —ik2L2) tanh 3 
Substitution of (19) into (17) yields, after rearrangement, 
tanh yi = } — tanh (iL) — a tanh (ikels) 
a & tanh (iL) tanh (ikeL2) +1] tanh yo} 
(20) 


1 
/} io Unttanhie iE ianhilsh Le) [- ah 


Epa bsha) |tanh vst 


From (5) we have, with c = c3, W = W3, x = Lz, and z = gr, 
Ze = pc3 tanh (W3+7k3L3) . (21) 
From (21) we derive, similarly as before, 


Re ries os Bees 
pc3 (22) 


tanh W3 = Z . 
1— =<. tanh (1k3L3) 
pcs 


Substitution of (22) into (20) and rearrangement yields: 
nies [ 20 Vienh Gh.) a tanh hale) } 1 t= ar tat ess) 
A z 
a {tanh Cel ah Gaia) 1 t} Se anhtiesta yee a1 


(23) 
Hh [ (1 +a tanh (ifiZ1) tanh (iheL2) | \1 = tanh (iksLs) } 


= = tanh (ikeL2) + tanh (thil1) t — tanh (iksLs) cared 


From (6) we have, for 1 = 0, 2(0) = Za, 

2, = peytanh y . (24) 
Equations (23) and (24) express the impedance 2, at A in terms of the 
geometric quantities Li, Le, Ls, R, and ¢, the velocities of wave propaga- 
tion ¢, ¢2, and c3, and the impedance 2y at F. 
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If the constriction is a total closure, 7 = 0, and we therefore have from 

(18) a = @ and obtain from (24), together with (23), 
Ag ail ln 25 
74 ~ tanh Ghali) Se 

independent of L; and zy as, of course, it should be. In addition, it is easy 
to show that (25) gives the impedance at the mouth of a tube of length 
L, which is closed at the other end. 

If the end of the tube at F is closed, zp = © and (24), together with 
(23), reduces to: 
Za => pCi { tanh (tkiL1) a a tanh (iR2L2) } tanh (1k3L3) 


+4* tanh (ihL:) tanh (éheL2) +1] 


(26) 
of _. {1.-} avtannh @RiL,) tah kiZ4))} { tath GEL) 


— {+ tanh (iteLs) + tanh (ibiLy) {. 


Forr = R, together with ZH, = E, = E; and 6; = 6 = 63, so thata = 1, 
(26) reduces to: 


bees pcr 
oA Hanke bi be Lae Cn 


which is the same expression as (25) with (Z; + LZ. + Zs) instead of Li, as 
should be the case, similar to the above. 

Formula (26) can be used to illustrate certain properties of the im- 
pedance z4. However, before doing so, we will use the fact that, if the 
influence of the viscosity is not neglected, the quantities k,(i = 1, 2, 3) 
occurring in (26) will have a positive imaginary part (cf. Morse, Joc. cit., 
p. 214). To simplify further treatment we will assume that L; is so large 


that we may approximate tanh(ik3Z3) by —1, its value for L3 being 
infinite, 


tanh (ik3sL3) = —1. (28) 
Introduction of (28) into (26) leads to: 


go peal — {tanh (ikiL;) +a tanh (ékeL2) } 


+2 tanh (7k,L;) tanh (ikoL2) + 1| 


(29) 
(a [1 +a tanh (iksZ;) tanh (iksLe) 


— {= tanh (itaLa) + tanh Ghul) {]. 
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This same formula (29) would be obtained from (23) if we had intro- 
duced (28), independent of the value of z;. The latter should, of course, be 
the case, since in an infinitely long uniform tube with damping no wave is 
reflected. From (4) we have at point A in Figure 2: 


pa= Zara. (30) 
Similarly, we have at the point G in that figure: 
Pe=Zere- (Silo) 


To find the relation between p, and fg we still need to determine zg and 
the velocities v4 and vg. The impedance 2¢ is given by: 


EL coer tanh(vs— +=" 1), (32) 


which follows from (5) if we take k = (2ry/cz) + i(d3/c3), x = L,c = 63, 
Y = v3, and z = zg. The quantity d; is a damping factor, due to the 
viscosity (cf. Morse, Joc. cit., p. 214). Introducing the expression of 
e *¥* obtained from (21) into (32) and, as before, taking the limits for 
Zp — ©, assuming the end at F to be closed, and ZL; sufficiently large, we 


find: 
Zq= pcs, (33) 


as should be the case because (33) is the expression for the impedance of 
an infinitely long tube. 
According to the formula (2) the velocity in the tube AB is given by 


VAB = Aze*s@i-%*) — By eiherteint : (34) 
pe 
that in the tube CD by 
Yep — Ag ets (2-2!) — Bo evihyestenn | : (35) 
p C2 


and, finally, that in EF by 


Uer= BLY 1 As ets (%s— a") — Bg ehestan| 3 (36) 
poz 


From (34) we obtain, for , = 0, 


1 , ; 
i —ik,c,t —ik,e 
a Sie ke tkyest — By e— #1) 


(37) 


ABSORP 
poy. 
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From (3) we have: 


=. == Se e711, (38) 


Introduction of (38) into (37) leads to: 


u4= iy Set allt Meni A (39) 
pei 


Similarly we find, from (36) and (3), 


’g= Aa (1 + e-2vs—2tksL) e—2xiyt | (40) 
p 3 
To find the relation between v4 and vg we have to find the relation between 
A, and A3. To establish that we proceed as follows. 


From (34) we have for «, = Zi: 
i= Wh Sf Ape.) Bethea} (41) 
z,=L, pcx 
Introduction of (38) into (41) yields: 
VB Lave “+e ,—2ik, L,) giky L,—2niyt | (42) 
pc, 
Analogous to (39) we have: 
Ag = 
de = 2 (1+ es) enone (43) 
p C2 
Introduction of (42) and (43) into (9) leads, after simplification and re- 


arrangement, to: 


R C2 1+ ey, 2ik, Ly 
AS nee ahaa ee (44) 


A similar relation holds between 43 and A»: 


12, 1 e—M. 2k Ly 
“Bea item et (45) 


Multiplication of equations (44) and (45) yields: 


os (1 -L e—™,—2ik, Ly) Ct -f e¥,—2tk, L,) : : 
Bes (1+ e-™) (1+ es) Mam eat ses 


Introduction of (29), (33), (39), and (40) into (30) and (31), and taking 
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the ratio of the equations so obtained, we find, using (46), 
— [ - {tanh (ikiLi) + tanh(ikoL.) } ++ tanh (7k,L) tanh (¢koL2) 
i 1| (1 e-%1) (1 es) (1+ es) et 2, +h, Ly +h, Ly) 
sab +a tanh (i#iZ;) tanh (ikoL2) — ie tanh (ikeLs) (47) 
+ tanh (iis) ¢] (1+ eh.) 


© (1 e722 22) (1 4+ e528, Ly) | 


For reasons of mathematical simplicity we will only consider some 
special cases. 
Solution of (21) gives: 


eed 

e 2s — SS erik, L, 4 (48) 
a =a 
ys 


Since we consider here only the case in which the end at F is closed, we 
have zr = © and from (48) we then obtain: 


e243 = — erik, L, , (49) 


Remembering that &; has a positive imaginary part we derive from (49), 
for the case L3 = © which we consider here, 


es =0, (50) 


as should be the case. Of course this holds irrespective of the impedance 
Zp at the end of the tube F. 


Similar to (15) we have: 
’ r2c,1— e7™s 
Wy = yp LR ee * cent, (51) 
. ‘ pegs eel 7 
R?o.1+te™s 


Introducing (50) into (51) gives: 


ens erik, Le | (52) 


To simplify the further treatment we will make the assumption that 
r°c3/R°cy <1. For values of r/R in the range 0.4 — 0.3, r2c3/R’c. varies 
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from 0.1 — 0.05, assuming c to vary only with R. Since it is likely that c2 
is still much higher due to changes in £ (and 6), our assumption 
7°c3/Rcy <1 seems to be justified. From (52) we obtain with that as- 
sumption: 
€ i es (aoa 


Substitution of (53) into (15) yields: 
R26. 1 — e2*Fehs 
a "98 C, 1+ eri L, 
eS, Bice a (94) 
ett 72 cy 1+ e?the ts 


With plausible values for k, and Lz we find |ikeL2| « 1. We may approxi- 


mate (54) then by: 
i - atkoL»e é 


2ik, L, 
1 ae atkoL» : see 


eh; = 


For a value of r/R of the order of 0.4 we have that even |iak2:L2| « 1, and 
we may approximate (55) by 
en, == pth, By. (56) 
Substitution of (50), (53), and (56) into (47), using the fact that 
k;L; K1 (j = 1, 2), so that we may approximate tanh(7k,;L;) by 7k,L; 
(j = 1, 2), and expansion of the obtained expression for z4/z¢, we obtain, 
keeping only terms up to the order of kikoLiLe, 


ae 1 1—(a-=)ikelet2 (a—=)érkeLatat cos (AL) cos(ReL2) e~*s”. (57) 


From (57) we obtain: 


|pal _ gh 2 1 ; 
oa =Vf142(« = hikalaLa|? + | —(a-=)iutl ee 
X cos (RyL1) cos (RoLs) . 
Since |k;LZ;| «1 (j = 1, 2) we may approximate (58) by 
| 2 1 oak 


From (57) we also obtain: 
i anes 
arg $4 ~ -(a-=) Gee | 2 (60) 


In (59) and (60) Z1, (a — 1/a)keZ, and L appear as unknowns, since 
k; = Iry/c; (j = 1, 3) may be assumed to be known. Since the total 
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length Z; + L, + L is also known (approximately Z; + L), we can ex- 
press L in terms of Z; and then eliminate Z from (60). Solution of (59) and 
(60) for (a — 1/a)k2L2 and LZ, will then give us the values of these un- 
knowns. Therefore we see that in principle Z; can be obtained from pres- 
sure measurements made at both sides of the coarctation. 

At the same time (a —1/a)kaZ2, which is a measure of the deviation from 
normal, is obtained, combining in itself the degree and the extent of the 
deviation. Of course it will be clear from the above treatment that this 
method will, in general, not give an accurate determination of the location 
of the coarctation, but it indicates another new method. Before using the 
relation obtained, it should first be checked with experimental data. Since 
k; = 2mry/c; we can bring (59) and (60) into the forms: 

| ba Be i= 1 167° (« ce Laks 47? (« --) (2) be, (61) 
a Ce 


l pe? ya oe 


Da. 5 Sot I» ig 
arg f4= —}20 (a = A+ tate (62) 


Since for a coarctation a >1 we would have, according to (61), an increase 
of | p4|2/|p¢|? with y. This might explain the fact, found by Porjé (oc. 
cit.) that behind the coarctation the fundamental oscillation is relatively 
stronger than the first and second partial oscillations. Though the 
trend in this aspect seems to be in the right direction, there are unfortu- 
nately not enough data to check relation (61) which gives a proportional- 
ity between | 4|?/|/¢|? — 1 and y?. More data, also obtained from model 
experiments, along these lines would be very useful. The latter also holds 
true for the linear relation (62) between arg p4/P¢ and y. 

This investigation was supported by a research grant H-627(c) from 
The National Heart Institute, of the National Institutes of Health, Public 
Health Service. 
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A mathematical model for the development of human society, beginning with the earliest 
stages of urban cultures, is outlined. In the early stages of history, behavior was characterized 
largely by adherence to a number of beliefs and prejudices of different kinds, which were 
accepted on faith and not subject to critical rational analysis. Due to psychobiological 
variability a very small number of individuals spontaneously appear at all times who challenge 
the accepted beliefs and prejudices and do not follow the accepted patterns of social behavior. 
The effect of these individuals upon the rest of the society, especially upon the younger genera- 
tion, depends on the facilities with which information spreads in society. In earliest societies, 
when modern methods of mass communication were unknown, the channels of communication 
were practically identical with the channels of economic transport. The latter in its turn de- 
pended on the nature of the roads, and especially on the presence of waterways, which facili- 
tated transportation. The sizes of the earliest cities and the distances between them were largely 
determined by relative ease of transportation. 

Expressions are derived for the average size of the earliest cities and for the average dis- 
tance between them. The calculated average populations of the earliest cities are of the order 
of 103; the distance of the order of 10? km. Both are in agreement with some archaeological 
findings. 

An expression for the time span required for the development from the earliest stages of 
urban cultures to the present time is derived and shown to depend on the specific shoreline of 
the country, that is, the length of the shoreline divided by the area of the country. It is pointed 
out that western Europe’s specific shoreline, including land bordering both seas and rivers, is 
ten times as large as the shoreline area of other parts of the world. It is shown that this greater 
specific shoreline may account quantitatively for the faster social and technological develop- 
ment of western Europe in the last few centuries. The calculated total span of time of develop- 
ment from earliest urban cultures to our days is found to be of the order of magnitude of ten 
thousand years. 

It is shown that the model accounts for the existence at the present time of primitive cul- 
tures. A number of suggestions is made in regard to other possible applications of mathe- 


matics to history. 


I. Introduction: statement of problem. Mathematics and history: this is 
probably the strongest antithesis in the field of human knowledge. One a 
purely rational, deductive science, the objects of which are abstract con- 
cepts, creations of the human mind; the other a purely descriptive 
discipline, sometimes dealing with almost appallingly concrete realities, 
such as the exact date of birth or death of a king or potentate. Even if we 
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compare history to the more concrete natural sciences—physics, chem- 
istry, or biology—the antithesis still remains. It has been argued that 
natural sciences and history are at opposite poles methodologically. The 
“natural sciences versus history” question has been discussed and a 
peculiar “historical” way of thinking has been advocated (Spengler, 
1922, 1923). All this may be so, if we accept a particular definition of his- 
tory. In fact, the statements about the “historical” approach as some- 
thing sui generis constitute an implied definition. 

If, however, we look at history as a description of the development of 
different human societies, and consider that history stands very much in 
the same relation to human society as paleontology to the whole king- 
dom of the living, then a naturalistic approach to history not only be- 
comes justified, but is strongly indicated. 

Some historians will probably agree that whatever happens in history 
is the manifestation of interactions of human beings either with each other 
or with the surrounding nature. But man is a biological entity and there- 
fore the problems of human interactions naturally fall within the domain 
of biology. Sociology and history then become, in a broad sense, branches 
of biology. Mathematics has successfully invaded biological sciences and 
has made notable progress in social sciences. The next natural step would 
be its application to history. 

But even granting that the statement above is correct, the historian 
may wonder about the use of mathematics, a quantitative science par 
excellence, in the purely qualitative field of history where hardly any 
quantitative statements are ever made. However, is the difference so 
striking? Many a high school student may have had difficulty with his- 
torical chronology. What is the latter but a set of figures which fix guan- 
titatively the time intervals between historical events? Thus numbers occur 
in history almost at its very roots. It may be argued that intervals of time 
between the births and deaths of some individuals are not likely to be an 
inspiring object for mathematical theorizing, and this may be so to a large 
extent. However, if we come to intervals of time between the “birth” and 
“death” of whole civilizations, which according to Arnold Toynbee (1935, 
1939, 1947) are the fundamental objects of historical study, then purely 
chronological relations are quite apt to. reveal important quantitative 
regularities, as we shall see later on. Furthermore a number of qualita- 
tive statements are found in history, which in principle at least could be 
brought into a much more definite quantitative form. For example, we 
find statements in historical treatises which run somewhat like this: “In 
such and such a civilization at such and such a period, a small class of 
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priests ruled the country. These priests had great wealth. There was also 
a relatively small class of wealthy merchants. The rest of the population 
were serfs or slaves, and lived in miserable poverty.” How much more 
complete and satisfactory to the historian himself it would be to say, 
even with a very rough and crude approximation, how large the respective 
classes were; how much wealth each possessed, and what constituted the 
dire poverty in terms of food consumption. 

Any historian will correctly remark that the desideratum above is 
much easier made than fulfilled, simply for lack of proper information. 
Here and there, however, numerical data appear in history. It is difficult 
to say whether this meagerness is due solely to the difficulty of obtaining 
data or whether, to a large extent, it may be due to lack of proper effort 
to obtain them, the latter being the result of lack of interest. But, meager 
as they are, some quantitative statements already exist about history. 
Since they are more difficult to obtain, the effort may hardly be worth 
while to the historian, if the only result of his effort is some data which 
is collected as such per se. If, however, it could be shown how these data 
can be used in order to obtain, by means of mathematical analysis, a bet- 
ter understanding of different historical phenomena and their causes, the 
historian would have an incentive to exert the greater effort needed to 
obtain such data. Not so long ago geologists and paleontologists were 
able to indicate only the sequence of various geological periods, without 
ever hoping to know their actual durations. Now a “geological calendar”’ 
is rather well established (Croneis and Krumbein, 1936). The future may 
hold some pleasant surprises for the historian also. 

At this point a remark may be inserted about the alleged purely quan- 
titative nature of mathematics. There are essentially qualitative branches 
of mathematics, such as topology, analysis situs, and Boolian algebra. The 
latter had been applied in the mathematical biology of the brain. Whether 
these branches of mathematics will be of any special use to the historian 
is quite conjectural. They are mentioned here, however, to put the quan- 
titative nature of mathematics into proper perspective. 

Opposed as the natural scientist and the historian seem to be in some 
respects, there is one thing in which they find a common ground. Both of 
them are interested not only in knowing what happens (or has happened), 
but also in understanding why it happens (or has happened). In natural 
sciences, observation and experiment seek the answer to the question 
“What happens?”; while mathematical theory seeks to answer the ques- 
tion “Why?” If “understanding” is the ultimate goal of both the natural 
scientist and the historian, then should not the latter be at least willing 
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to give a trial to a method which has brought such a success to natural 
sciences? 

How does mathematical theory achieve its purpose of understanding 
nature, of finding the unseen causes of observed phenomena? It does so 
by studying abstract quantitative conditioned propositions of the form 
“Tf so... then.”? When the “then” part of such a proposition is found to 
agree with the experiment, the scientist justly concludes that the pis 
part may hold. The quantitative nature of this proposition is essential. A 
qualitative statement that under such conditions such an event must take 
place cannot always be unambiguously verified experimentally. Failure 
to observe the expected event may be due to the fact that the event is too 
small or too weak to be detected with instruments available at the time. 
This ambiguity does not exist for quantitative statements. 

Verification of the ‘If . . . then” statement in numerous cases lends it 
a degree of reliability which enables us to conclude with certainty about 
the “then,” even in cases where the latter cannot be directly observed. 
This is essentially the principle on which are based determinations by 
calculations of distances to the sun or moon, the sizes of the sun, moon, 
and the planets and their masses. None of these quantities can ever be 
measured directly; yet their calculated values are certain, if there is any- 
thing certain in science. 

In undertaking the study of any group of phenomena the natural sci- 
entist abstracts from that group only the most essential features. So 
does the modern historian. Toynbee (1935, 1939, 1947) puts at the basis 
of his Study of History not individual nations but what he calls civiliza- 
tions, of which he identifies some twenty odd. Even the concept of a 
nation is to some degree an abstraction, and the concept of a civilization 
is an abstraction on a higher level. This abstraction enables Toynbee to 
find some regularities and uniformities in the historic process. His ideas 
of growth and senescence of civilizations may seem to be somewhat an- 
thropopsychic, patterned too much after the corresponding processes of 
an individual. But the general patterns of historic development which he 
establishes definitely suggest the possibilities of mathematical formula- 
tion, as we shall point out in the last section of this paper. 

We shall, however, attempt a mathematical approach to history from 
a different point. For our purposes we shall make an abstraction which 
goes beyond that made by Toynbee. 

Many of the civilizations studied by Toynbee, while differing from each 
other in many features, still have some very essential features in common. 
Different as the Egyptian, the Babylonian, the Greco-Roman, the Indian, 
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and the Chinese civilizations were in ancient times, they all had the fol- 
lowing properties in common. Their social structure was characterized by 
a hierarchy with a class of priests and/or of kings at the top. The masses 
were enslaved to different degrees. When we compare the descriptions 
of the social structure of all the civilizations mentioned above, as given, 
e.g., by Ralph Turner (1941), we cannot help but notice the general 
similarity. In a previous study (Rashevsky, 1951a, 1952a) we have repre- 
sented such structures mathematically by a so-called hierarchy matrix. 
This type of social structure is characterized by the notion of an inherent 
inequality of men. Even the Greek “democracy” promulgated the equal- 
ity only for a select class. 

The second characteristic common to all ancient civilizations is that 
the greatest bulk of what they considered to be knowledge was based on 
beliefs and not on ascertainment of facts in a modern scientific manner. 
The basic beliefs were of a religious nature, and religion also offered the 
ancient man the “explanation” of the universe. Characteristic of this 
type of “knowledge’’ was a monopoly by the priestly class of some 
aspects of it, as well as a general prohibition of certain “doubts” and of 
any rational inquiry into some questions of faith, a prohibition which 
reached its culminating point in medieval Christianity. Acceptance on 
faith and prohibition of doubt and rational reasoning are to different 
degrees common to all ancient religions, as has been so forcibly described 
by Homer Smith (1952). True enough, the Greek philosophy showed a 
remarkable deviation toward freedom of thought. Still the Greek philo- 
sophical systems were not based on experimental evidence. Modern 
scientists will debate the pros and cons of physical theories by discussing 
which of the theories better represent the known facts. A choice between 
Plato’s philosophy and that of his disciple Aristoteles could be made 
essentially only on the basis of personal predilection to a particular type 
of speculation. Euclid and Archimedes were very rare phenomena indeed! 

This characteristic of all ancient civilizations should perhaps have been 
mentioned before the social characteristics of the ancient civilizations. 
For the idea of the inequality of men, shared by Plato and Aristoteles, 
was essentially a prejudice which persisted because it was accepted on faith 
without critical analysis. Thus the peculiar social structure was closely 
related to the general tendency to accept everything on faith. 

Against this background of prejudice and actual ignorance, some: SCl- 
entific and technological progress was being made in all of the civiliza- 
tions. The rates of this progress were fairly equal in different civilizations. 
Arts frequently developed to a high degree in all of them. 
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So things continued until about the sixteenth century, when Western 
civilization, the offspring of Hellenic, began to show signs of radically new 
developments. Modern scientific methods began to emerge, coupled with 
a more and more pronounced skepticism toward the accepted religious 
teachings. Toward the nineteenth century this cluminated in an ascend- 
ancy of rational thinking over acceptance on faith. The rate of scientific 
and technological developments increased a hundred fold compared to 
a couple of centuries previously (Rashevsky, 1952a). At the same time 
deep social changes have taken place. The old superstition of inherent in- 
equalities of men is being replaced (not without difficulties) by scientific 
studies of mental abilities of different races, which relegate this supersti- 
tion to the rank of a religious myth. 

All these developments resulted in an acquisition by Western civiliza- 
tion of a command over nature hitherto not dreamt of, which, in its turn, 
resulted in a betterment of general economic conditions. Nothing of the 
sort happened in India, China, or the Near East, however, until quite 
recently, when we began.to witness an “‘awakening”’ of these countries. 
Only a couple of decades ago those countries had the old hierarchic social 
structure with its prejudices of inequality. Scientific and technological 
progress did not exceed that of five hundred years ago. Yet the helio- 
centric system was propounded by Aryabhata in India in the fifth century 
A.D. (Dunbar, 1943, pp. 80, 87). 

Finally, we also now have a number of so-called ‘“‘primitive”’ cultures, 
which have survived for ages and never developed the art of writing. 

Thus we notice three distinct groups of mankind: one forming the primi- 
tive civilizations, one comprising the Eastern and Near Eastern civiliza- 
tions, and one the Western. The last two did not differ in their modes of 
development very drastically for millenia, but within the last few cen- 
turies the Western civilization achieved an unprecedented degree of sci- 
entific knowledge, resulting in a mastery over nature and accompanied 
by radical social changes. 

The question naturally arises: why this difference in the development 
of the three groups? The simplest answer, which for some time has been 
taken for granted, is that the peoples comprising the Western civilization 
are mentally superior to those of the East, and the latter, in turn, are 
superior to those of primitive societies. 

A methodological disadvantage of this view is that it really leaves 
nothing to explain. A much more serious disadvantage is that it is con- 
trary to all experimental evidence (Klineberg, 1947). 

If the differences in the rates of cultural developments are not due to 
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innate differences, then their causes must be thought to be in the environ- 
ment. This point of view is now accepted explicitly or implicitly. But no 
one has yet suggested a modus operandi of the environment upon the 
rates of cultural development. The mechanism of the effect of environ- 
ment is still only to be guessed at. 

It is the purpose of this paper to show that certain features of the en- 
vironment can guantitatively account for the large differences in the rates 
of cultural developments discussed above. As with every scientific theory, 
we do not suggest here the explanation of the phenomena in question. The 
explanation is a fiction. We offer, however, @ possible explanation, which, 
in principle at least, is verifiable by procuring in the future appropriate 
data both about the past and the present. 

“Ts that all mathematics can do?” may be asked by the skeptical his- 
torian. “How about numerous aspects in the development of individual 
civilizations? How about the almost innumerable facts and details of his- 
tory? Can you explain those?” The answer to these questions is: “We do 
not know.” No theory in any science was ever able to explain everything 
at once. Explain one fact, then try to explain another; then the next one. 
But unless we begin by trying to explain specific individual facts, we can 
have no hope at explanation at all. We indicate here a way of mathemati- 
cally approaching some problems of history. Whither this way leads, only 
future research can show. 

There is also another point to be considered. In the physical sciences, 
experiment and theory developed more or less hand-in-hand. Galileo was 
soon followed by Newton. The theoretician had less facts to explain. In 
the days of Newton, had the physicists have possessed all the knowledge 
of electricity and of atomic phenomena, the theoretical contributions of 
Newton, Huygens, Fresnel, and others might have looked insignificant 
indeed. In the social sciences, and especially in history, the collection of 
facts had already progressed, even though theory has been lacking. 

II. Geographic factors and distribution of populations. Our problem is 
to find an explanation of how environment can affect the rate of change 
from an “arational”’ behavior, characterized by acceptance on faith, toa 
“rational” one, characterized by critical analysis of experimental evi- 
dence. In particular we seek a possible reason for an accelerated develop- 
ment of the Western civilization as compared with others. 

What environmental factors are characteristic of western Europe as 
opposed to other parts of the world? If we consider climate, we see that 
+t does not answer the specification. While there are large differences 1n 
climate between different parts of the world, almost equal differences are 
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occasionally found locally. What is more important is that the climate of 
China differs much more from that of India than from that of western 
Europe. Yet both China and India were equally retarded in technological 
and social development. Considering fertility of the soil, we find that 
local variations are frequently larger than the variations of averages from 
one part of the world to another. In ancient times the valleys of the Nile, 
Tigris and Euphrates, Ganges, and Huang-Ho were all equally character- 
ized by their fertility. Western Europe also possesses some very fertile 
land. 

If we consider the population densities as an environmental factor, we 
find high population densities both in India (80/km.—*) and China 
(40/km.—*), and a tremendous population density of over 400 in Egypt; 
but also in western Europe (80 for France, 140 for Germany, and 270 for 
England). Though on the average Europe is now more densely populated 
than Asia, there is no reason to assume that this was the case at the dawn 
of civilization: most likely the situation was reversed. Thus we do not 
have a specific difference here either. Sparsely populated Russia has had 
until recently a rate of development closer to that of the more densely 
populated Far East. 

A glance at the world map shows one striking feature, however, which 
makes western Europe different from all other continental parts of the 
world: it has by far the greatest specific sea shoreline, that is, the length 
of the sea shoreline divided by the area of the country. In no other part 
of the world has the sea made so many inroads with its many gulfs and 
straits. The specific sea shoreline of Europe is 4.1 X 10-%, while that of 
Asia and Africa, which have smooth, rounded-up contours, is 1.7 & 10-3 
and 10~* correspondingly (Woytinsky, 1925, p. 2). But the figure 4.1 
refers to all of Europe. If we subtract from the total sea shoreline of about 
39,000 km. (Woytinsky, Joc. cit., p. 2) some 5,000 km. of Russian sea shore- 
line, and divide the result by the area of western Europe [total Europe less 
Russia and Poland (Statistisches Reichsamt, 1937, p. 7*)], we find a 
specific sea shoreline of about 6 X 10-%. 

The total river shoreline for western Europe is also considerably larger 
than for other parts of the world. Using data from Rand McNally’s 
Cosmopolitan World Atlas (Rand McNally, 1949, pp. 116 ff.) we find a 
specific shoreline for western Europe to be about 9 X 10-*, whereas for 
China it is only 1 X 10-* and for India 5 X 10-4. Thus the total specific 
shoreline of western Europe (sea and rivers) is almost 10 times as large as 
that for China and India. 


If we now turn our attention to the Polvnesian Islands, which are the 
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seats of many primitive cultures, we notice that while they have a high 
specific sea shoreline, they do not form a single continent-like area, but 
constitute an assembly of isolated small units. 

We shall now proceed to demonstrate that the differences in specific 
shorelines may quantitatively account for the observed differences in the 
rates of cultural developments. 

In previous papers (Rashevsky, 1951a, b) we have studied the bio- 
sociology of the change of the behavior of society from an arational to a 
rational one. A factor in the mechanism of this change is the ability to 
communicate a new idea, or merely a dissidence from an old idea or old 
prejudice, to others. This ability to communicate depends on the struc- 
ture of what may be called the “‘social communication net.’’ The mode of 
distribution of population over a territory does affect the structure of the 
social communication net. Information spreads differently through a 
uniform thinly spread population and one which is concentrated in a few 
densely populated areas. 

As we now shall see, the specific shoreline does sufficiently strongly 
affect the distribution of the population, and hence the structure of the 
social communication net, to account for the observed differences in the 
rates of cultural development. 

The modern methods of mass communication have altered the picture 
considerably. However, those methods became available only within the 
last century, which constituted only about 13% of the total span of de- 
velopment of humanity since the beginning of earliest urban cultures ca. 
5,000 s.c. (Turner, 1941). Since the differences in the rates of development 
of the West and East began to be noticeable well before the advent of 
these methods of mass communication, we can legitimately leave them 
outside of our considerations so far as our immediate problem 1s con- 
cerned. We shall seek the roots of the differences of the rates of develop- 
ment in the earliest stages of urban cultures. a 

Any urban culture, even the earliest, implies a division of labor be- 
tween city and country. It also necessarily implies a nonuniform distribu- 
tion of population. City population must be supplied with food from out- 
lying agricultural areas. The rate of supply possible depends on the avail- 
able means of transportation, which, in their turn, depended (in antiquity) 
essentially on the character of the terrain and especially on the presence 
of waterways. Easy transportation may permit supplying a very large 
city from rather far distances. With limited meaus of transportation it 
may be impossible to bring food necessary for an adequate supply into a 
large city from the vast outlying rural area. This may necessitate break- 
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ing up a large city into smaller ones so that each can be supplied from a 
smaller area by the use of available transportation facilities. 

We shall now derive an approximate mathematical expression which 
will show how the average size of cities depends on the means of transpor- 
tation and how the latter depends quantitatively on the presence of 
waterways and the size of the shoreline. 

Consider the following crude model of early urbanization. 

Let p, be the average productivity per unit time of urban goods per 
urban individual, expressed in some appropriate measure. 

Let p, denote a similar thing for rural, agricultural goods. 

Let c, be the average per capita consumption of urban goods and ¢, 
the average per capita consumption of rural goods. 

A number J; of individuals will be engaged in the business of trans- 
porting goods to and from the rural areas. We shall arbitrarily count these 
N, individuals as part of the urban population. The number of individuals 
engaged in actual production of urban goods shall be denoted by N,. If 
NV, denotes the total urban population, NV, the total rural, and NV the total 
population of the country, then 


Nu= Not Ni; Not Nit N-=N. (1) 


Under the relatively primitive conditions which we are now considering, 
any transportation by land is done by means of camels, horses, or other 
burden-carrying animals. Let the total number of animals used for that 
purpose be L, the average load per animal be »’, and the average velocity 
of locomotion be v. Each animal requires attendance by some of the NV; 
men during the process of transportation. Let g be the number of animals 
which can be attended by one man. In general g = 1. Then 


L=qN:. (2) 
We shall now introduce the average load » per attendant. We have 
v=q’. (3) 


Let there be m cities altogether spread approximately uniformly 
through an area S. The average country area a per city (neglecting the 
area of the cities themselves, which is always justified) is 


a=. (4) 
m 


The average distance 7 over which the agricultural products must be 
moved to the city in order to supply it with food is roughly of the order 
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of magnitude of 
.= vs (5) 


if we exclude very unusually shaped areas or strictly linear arrangements 
of cities in a narrow strip of land. 

The total number of burden-carrying animals per city is L/m. 

If, on the average, the animal has to travel a distance 27 back and forth 
on each trip, then per unit time there will be v/27 such trips. On each trip 
the animal brings an amount »’ of supplies. Hence the total amount of 
rural supplies brought to a city, per unit time, is 


w’L 


27m’ (6) 
which, because of (2) and (3), may be written as 
vv N; (7) 


27m~ 
This must be equal to the total consumption V,c,/m in each city. Equat- 
ing (7) to N.c,/m, and substituting for 7 the expression (5), we find 
Foa72 
=. (8) 
al ie 
The average population 7 of a city is equal to N/m, and, therefore, 
because of (8), 

= yu Ni (9) 

"= FASN,,” 


The total production of urban goods Vz, is equal to the total con- 
sumption Nc,. Hence 


C. 
=N—. (10) 
Np ry 
Similarly we have 
Ne= N=. (11) 


Introducing (10) and (11) into the second equation (1) and solving for V; 
we have 


Cu Cr _ Pups — CuP> — Crbu v7. G2) 
| : tl Ce rs Pub 
Putting 
eee athe bons Whe anagem i (13) 
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remembering (1), introducing (10), (11), and (12) into (9), and putting 


N 
a wi (14) 
we obtain 
pais ilar 2 15 
"Fe (put p) agi iby 


The greater the population density D, the greater the average population 
n of a city. 

In considering the rural productivity p, as constant, independent of the 
population density, we tacitly assume that the population density is 
not too high, so that there is enough land per person to obtain the maxi- 
mum production of which an individual is capable. This condition may 
have been rather widely present. In a subsequent paper we shall specify 
the conditions under which the assumption is valid and investigate the 
actual range of its validity. A more general approach is needed in those 
cases where the assumption is not valid, that is, where the density of 
population is excessive. 

The following remark may be made. The smallest annual yield of 
wheat per hectare known at present in the underdeveloped countries is 
280 kg.; the world’s average is about 900 kg. (Statistisches Reichsamt, 
loc. cit., pp. 42*-45*). Similar figures hold for other cereals. Even the 
lowest figure is sufficient to supply one person for a year. Thus even with 
the lowest figure and a population density of about 100 per km.?, each 
individual would have enough land. True enough, land is not exclusively 
used to grow cereal, and not all land is equally useful. But then the figure 
of 280 kg. per hectare is probably-much too low for the ancient world. 
If we take the present world average, it will increase the admissible 
population density to about 300 per km.* Then, all corrections taken into 
account, it is safe to use the assumption of a constant #,, as long as the 
population density is less than 100. 

If, with Flinders Petrie (1923, p. 80), we assume that the annual yields 
of crops in ancient Egypt where of the same order of magnitude as in 
present day England, that is, about 2,000 kg. per hectare, then allowable 
population densities will increase even more. 

For our purposes, however, the dependence of % on 2 is of greater inter- 
est. The average velocity vis affected by a number of factors. In a rugged 
mountain country or in a jungle, v will be much smaller than in the open 
plain. But the factor which affects v most is the river or sea shoreline. 
Water transport in antiquity and until recently was always much more 
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rapid than land transport. Moreover », the load per attendant, is also 
larger for water transport. The nature of the animals used also affects vv 
and therefore . Thus if elephants are used, who carry—say—ten times 
as much as horses, and if the number of attendants per animal is the same, 
then v is increased tenfold. 

The following rough calculation gives an idea of the effect of water 
transport on vv. Consider a transport by horses and carts, with a load per 
horse of 1 ton, average speed of 4 km./hour, and one attendant per horse. 
This gives a value vv = 4 ton km. hr.—. Consider a sailing ship, carrying 
100 tons, manned by 5 individuals, and averaging a speed of 15 km. 
hr.—. The value of vv is now 300 ton km. hr, that is, 75 times greater. 

However, we cannot apply equation (15) directly either to a country 
using both land and water transport or to one using water transport pre- 
ponderantly. This is because, in the latter case, the geometry of the coun- 
try does not correspond to the assumption implied in equation (5). Egypt 
is such a case. The Nile valley is about 900 km. long and, on the average, 
50 km. wide (Breasted, 1908; Wilson, 1951). 

Consider an urban culture in which cities are located along a river or a 
sea shoreline of length / and width A. If m, is the number of cities, then 
the average distance between them is 

ae (16) 

Ms 
The average transport distance 7,, which corresponds to the distance 7 in 
expression (5), now is 
(17) 


The total amount of rural goods transported to a city is again given by 
an expression similar to (7). We shall denote by (vz), the value of vv for 
water transport. Then, instead of (7), we have 
(vv) Me (18) 
27.M, 

This again must be equal to the total consumption N,c,/m, of a city 
per unit time. Therefore, using (1), (10), (11), (12), (13), and (17) we 
find an expression which corresponds to (8): 

— (put p) Cr (19) 
3 p(vv)s 


Hence the average population 7, of a city is equal to 
= Nu Np(v). DpA(vt)s (20) 
n = = 


—— 
— 7 


Ms Gol Cr 
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Denote by A the ratio of the average value of (vv), for water transport 
and the average value of (vv) for land transport. These values, in general, 
differ for different types of goods transported. But inasmuch as we are 
dealing here with crude approximations and are primarily interested in 
orders of magnitude, the consideration of averages is justified. Thus we 


have 
(2) AG (2A) 


Hence 
as as (22) 


In general only some of a country’s cities will be located along the main 
rivers or along the sea shore, and their population will be approximately 
determined by (22). Other cities will be located sufficiently far from rivers 
or sea; they will be supplied principally by land transport, and their 
average population will be given approximately by (15). Actually there 
will be some cities which will receive part of their supplies by land, part by 
water. Asa first approximation we shall disregard these intermediate cities 
though a further refinement of the theory outlined here would have to take 
them into consideration. We shall also assume that the density D of 
population is the same along the shorelines and inland, though, in gen- 
eral, this is definitely not true. 

The total number of cities is equal to m + m,. Therefore the average 
population 7, of a city in a country with mixed transport is 

Nu 
m+ m,: 


n,= (23) 


Denote by & the specific shoreline, that is, the length of the shoreline per 
unit area. We have, for any country, 


T=éS. (24) 


Since, as we have assumed, water transport conditions prevail along the 
length / of the shoreline in a strip of wide A, the total area in which land 
transport conditions prevail is equal to: 


S—lIA=S(1—Aé). (25) 
Expression (8) for m may be transformed by introducing into it ex- 
pressions (10), (12), and (1). Putting 
p2 


Amr)! oe 
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and remembering (14), we find 


(27) 


Now replacing S'in (27) by expression (25), introducing (24) into (19), 
and then introducing (27) and (19) into (23), we find, again using (1), (10), 
and (12), as well as (21): 
be A pAD (vv)? 
~ 4pA— [4pAA— Ac, (v0) ] E° 

For € = 0, expression (28) reduces to (15), as should be the case. When 
the total length of the shoreline / becomes so great that JA = SS, or, be- 
cause of (24), AE = 1, then we have a situation in which water transport 
conditions alone exist. Hence the upper limit physically possible for £, in 
a land with mixed transport conditions, is given by 

1 


oS ae (29) 


Ne (28) 


For & = 1/A expression (28) reduces to (22), as should be the case. 


If 
Ac, (vv) 


30 


A< 
then the coefficient of & in (28) is negative and 7, decreases with increas- 
ing &. Physically this is due to the fact that a very small A implies a very 
large 7, in order to secure a productive area per city sufficient to provide 
for the city’s population. If 7, becomes too large it may upset the ad- 
vantage of a larger (vv),. The larger A, the smaller may be A. Equations 
(5), (26), and (27) give 


phat WS be 31 
OT Doped) © ioe 
while equations (17), (19), and (21) give 
_ pA») 32) 
Sree : 


The expression for the average transport distance per city in a land with 
mixed transport conditions may be found in the following way. Because 
of (25), expression (27) for the total number m of inland cities becomes 


A (put p)S(1 —AE) 23 
ie am Tr’ Ty 2) Years ( ) 


From (19), (21), and (25) we find the number of cities on the shore: 


_ (put p) 6rES (34) 


Ms pA (v2) 
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Dividing the total area S by the sum of (33) and (34) gives us the average 
area per city. The square root of this [cf. expression (5)] gives us the order 
of magnitude of the average transport distance 7. We thus find: 


i ap A pA (vv)? 35 
n= Vos eh Lois i3e2 


For & = 0 this reduces to (31), as should be the case. 

Let us consider some numerical examples. If we consider cereals as the 
principal consumable rural product, the order of magnitude of c, may be 
taken as 1 kg. day—!. The quantity p, + p is essentially the ratio V../N 
of the total urban population to the total population. Though there are no 
data on this quantity for very deep antiquity, it seems likely that in the 
days of the earliest urban cultures (some 5000 B.c.) the ratio V,/N was 
of the order of about 10-*, so that only a small percentage of the popula- 
tion lived in the cities. The ratio p/(p. + p) is essentially the fraction of 
urban population which is engaged in transport and trade with the coun- 
try. Let us assume, as an illustration only, that that fraction is of the 
order of 10-!. As we have seen above, A may be of the order of 10°. For 
A we shall take 10 km., that is, about 5 km. on both sides of a river. Any 
larger value of A would imply a considerable amount of land transport. 
The velocity v is not to be taken as the average velocity of the burden- 
carrying animal, but smaller. For the individual who makes the trip from 
the town to the country and back spends time not only on actual travel 
but also on loading and unloading at various points over a fairly large 
area where he travels. This reduces the effective speed v. Considering the 
poor road conditions of antiquity, we shall assume that the effective 
average speed was about 5 km. day~!, with a load of 100 kg. per at- 
tendant. This means that it took a day to pick up a load of 100 kg. and 
move and deliver it a distance of 5 km., which does not seem too im- 
plausible. We shall take v ~ 5 km. day—!, »v ~ 10? kg., which is a fair 
value for large burden-carrying animals. Thus we take 


¢-~1 km. day ; Put p~10; p~2X10*; 


(36) 
A=10 km. ; (vv) =5 X 10? kg. km. day ; A~10?. 
With these values we find from (26) 
Ar10-*. (37) 
Introducing the values (36) and (37) into (28) and (35) we find 
= _ 25D 
‘1-955 Cs. 
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and 
bees 20s.» 
- V1—9.5¢- (3?) 


As we have seen above (p. 204), £ varies in the range from about 10-3 
to 10-*. That would result in a variation of 7, from 25D to 27.5D, a varia- 
tion of about 10%. 

Hardly anything is known about the density D of population at the 
dawn of history. At the beginning of our era the population of Egypt was 
estimated to be about 7,000 (Langer, 1948, p. 21). It has more than 
doubled since then, with a population density now of over 400 per km.2 
(Statistisches Reichsamt, 1937, p. 9*). It would be very unlikely that the 
population remained essentially constant between 3000 B.c. and our era. 
If anything, it should have increased more rapidly than it increased in the 
last 2,000 years, because the present high population density would cor- 
respond to the upper part of the sigmoid growth curve. If we assume that 
at the beginning of the dynasties the population was about a million, we 
find D ~ 30. This gives 7, ~ 10°. This value is not implausible and is 
probably of the correct order of magnitude. John A. Wilson (1951, p. 34) 
states that ‘‘. . . one may doubt whether any community in earliest Egypt 
deserved the term ‘city.’ They all were agricultural villages of greater or 
lesser degree.” Petrie (Joc. cit., p. 21) mentions that the town of Kahun 
in the 12th dynasty (about 2000-1800 B.c.) had about 360 houses. The 
same seems to hold of the earliest Mesopotamian cities. Jarmo which was 
an agricultural village at about 4700 B.c. had an estimated population 
of 200 (Braidwood, 1950, 1952), and occupied an area of about 12,000m.?, 
thus giving an average of 60m.? per person. Eridu at about 4000 B.c. had 
an area of 25,000 (Lloyd and Safer, 1947). If we assume the same density 
of population, this corresponds to about 4,000 inhabitants. Later cities 
were apparently considerably larger. Without giving sources, Gordon 
Childe (1952, p. 94) states that in the 3rd millennium B.c. the populations 
of Lagash, Umma, and Khafajah were reliably estimated at 19,000, 
16,000 and 12,000 respectively. These figures do not contradict the 
former, for we certainly would not expect that during two millennia the 
cities of the ancient world would not have appreciably increased. On the 
other hand, the meagerness of earliest archeological data does not permit 
us to draw too definite a conclusion as to the size of the earliest cities, and 
the figures quoted above may have to be revised. This will simply mean 

-a different choice of the parameters (36). Taking p ~ 10~ will make 
n ~ 104 
Equation (39) gives us for the order of magnitude of 7, the value of 
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about 50 km. It is interesting to note that the distances between the early 
nome capitals in the Nile delta, as well as between the early cities of Meso- 
potamia, were about 20 miles, that is, 32 km. (Petrie, 1923, p. 4). The 
order of magnitude is again correct. All this does not prove anything, how- 
ever, because of the arbitrary choice of the parameters (36). The agree- 
ment may perhaps indicate the approximate correctness of our choice. 
The desirable thing to do is to derive the values of the parameters from 
general theoretical considerations. This will be done in a subsequent 
paper in which we shall study the mechanism of formation of early cities. 
The only value of the above considerations is that they serve as illustra- 
tions of what can be done in principle with a mathematical approach to 
these problems. 

Actually, cities in any civilization are not all of the same size. There 
are few large cities, more small ones. The theory of the distribution of city 
sizes offers interesting problems (Rashevsky, 1948) which are still largely 
unsolved. Pending their solution we may assume some plausible distribu- 
tion function of city sizes. If, for example, we assume that the sizes of the 
cities vary according to pure chance (an assumption which is certainly not 
precisely true but which may serve as a fair approximation), then the 
fraction of cities with a population between and ” + dv is given by 

f (n) dn==e "dn, (40) 


e 


where 7, is the average size of the city, which we have calculated above 
[eq. (28)]. Introducing expression (28) into (40), we find the expression 
of the distribution function in terms of &, the specific shoreline. 

IIT, Mechanism and rate of evolution of human society. At a very early 
age, considerably before he acquires any ability for critical reasoning, a 
child is taught a number of things, and perforce accepts these uncritically 
on faith. Some of the things a child so learns may pertain to rules of 
conduct, and are thus of the nature of conditioned reactions. Other 
things may pertain to the existence of God or other deities, devils, angels, 
goblins, or other mythological entities. These things are also fundamental- 
ly of the nature of conditioned reactions. Objectively we can learn whether 
an individual believes in God, witches, or poltergeists only by either notic- 
ing his statement to that effect, which is a verbal neuromuscular reaction, 
or by observing some other type of overt behavior such as church at- 
tendance, performance of rites of exorcism of the devil, etc. 

Those early conditioned reactions grow in strength with time, due to re- 
inforcement by repetition. They all form a type of behavior which we 
call arational (Rashevsky, 1951a). 
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At a later age the human being gradually develops the ability for 
critical reasoning and a tendency to analyze assertions before accepting 
them on faith. The intensity of this tendency also increases with time at a 
rate which varies from individual to individual. If the rate of increase of 
the tendency for rational behavior is greater than the rate of increase in 
strength of the previously conditioned reactions, then at some age the 
individual will abandon at least some of his early acquired beliefs and 
prejudices, and will change from arational to rational behavior. If some 
of the beliefs have been taught very early and, as a result of this, have 
been very strongly conditioned, then the developing ability for critical 
thinking may never be able to overcome them. For example, it is not in- 
frequent that scientists, who would take nothing for granted in their 
fields, devoutly believe in various church doctrines; or, like the Nobel 
prize winners, J. Stark and P. Lenard, accept the Nazi prejudice of racial 
superiority. 

The intensities of the conditioned reactions and of the tendency for 
critical analysis reflect the intensities of excitations of certain regions or 
“centers” of the brain, which mediate those reactions and tendencies 
(Rashevsky, 1948). Let e; denote the “‘central’’ intensity of the condi- 
tioned reaction and e, the intensity of the tendency for critical reasoning. 
sie o=E—«- (41) 

If ¢ = 0, then the tendency to accept on faith and the tendency to 
analyze everything critically are equal: the individual has no preference 
one way or another. Under those conditions he will behave rationally and 
arationally with equal frequency (Rashevsky, 1951a). If @ > 0, the tend- 
ency to arational behavior is greater. The greater 4, the greater will be the 
relative frequency of arational behavior. For example, the individual may 
behave rationally in 20% of all situations which he faces, arationally in 
80%. For sufficiently large positive values of ¢ the frequency of arational 
behavior of the individual becomes practically 100%. 

When ¢ <0, the relative frequency of rational behavior is greater, 
and it increases to 100% as @ decreases toward — ©. H. D. Landahl 
(1938) has developed equations which give the relative frequencies of 
both types of behavior for a given value of ¢ (Rashevsky, 1948). Le 

As a first approximation we may assume that both the conditioned 
arational reactions and the tendency for critical thinking increase linearly 
with age; one beginning from zero at age 70, the other beginning with zero 
at age r* > ro. Then introducing two constants 0 and A, we have 


ep= b(t—70) fort >, ebay tor b> ty 2 (42) 
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If b and ) vary from individual to individual according to a normal 
distribution curve, then the average value ¢ of ¢ for a sufficiently large 
population, which increases exponentially, can be shown (Rashevsky, 


1951b) to be 


$2 em} eve, (43) 


vy nd 
where ¥ is a biological constant closely related to the birth and death rates 
in the society. If 8 denotes the net relative rate of the increase of popula- 
tion and if e~ individuals survive one year after birth, then 


y=B+n- (44) 


If é > 0, more individuals will exhibit arational than rational behavior, 
because there are more individuals with a greater tendency for arational 
behavior. Moreover, the effect is enhanced by imitation phenomena or 
phenomena of social pressure. Those phenomena cause some individuals 
to choose a particular behavior because ‘‘Most of the others do it,” not 
out of personal preference. Taking this into consideration, we find that the 
fraction yo of individuals who behave rationally is given approximately 
by the expression (Rashevsky, 1951b): 


n= [vara A), as 


where A and B are constant (the meaning of which is explained in the 
paper immediately referred to above) and G(x) is defined by 


x’? 


G(a) — (Qn) -2 [exp ax’. (46) 
0 


For sufficiently large values of ¢, the fraction yo is less than 3, and yp > 0 
as@Go+t om, ‘ 

In early human society ¢ must have been sufficiently positive and yo 
very small: most of the individuals in the society behaved arationally. 

The yo individuals who do not follow the generally accepted behavior 
may conveniently be called “nonconformists.” If the nonconformism 
consists of a non-adoption of prejudices and in rational behavior, the yo 
individuals may be tentatively identified with Toynbee’s “creative 
minority” (Toynbee, 1935-39, 1947). 

Because, as seen from equation (45), yo depends on ¢ and the latter, 
through equation (43), depends on 79, the age at which particular beliefs 
and prejudices begin to be conditioned, therefore yp depends on 79. With 
increasing To, Yo also increases (Rashevsky, 1951b). Thus yo is both a 
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function of the biological constants of the individuals who compose the 
society and of the social factors which determine the age at which different 


beliefs begin to be taught. Therefore yo varies for different beliefs, 

The evolution of the behavior of the society from an almost totally 
arational to a more or less rational one means a gradual increase of the 
fraction yo of nonconformists. What can cause such an increase? 

Two possibilities have been suggested. The first is to assume that all 
the nonconformists raise their progeny without teaching them the ac- 
cepted prejudices. The existence of the fraction vp of nonconformists is 
due to the natural variations of } and \ from individual to individual. A 
fraction yo of all the newborn individuals will be, so to speak, spontane- 
ously nonconformist, and this fraction remains constant from generation 
to generation if all newborn individuals are subject to the same early 
conditioning. If, however, the progeny of all the nonconformists is raised 
as nonconformists, the situation is different. Denote by N the total popu- 
lation, by WV the number of nonconformists. Let the relative birth rate be 
uu, so that wN individuals are born per unit time altogether, and uN indi- 
viduals are born per unit time from the V nonconformist parents. The 
uN individuals born from nonconformist parents are all nonconformists. 
Of the remaining »(N — WV) newborn individuals, a fraction yo spon- 
taneously becomes nonconformist. If v denotes the relative death rate 


and if sera (47) 
then 
EN = (u-) N=BN, (48) 
and . 
cali uN+you(N — N) —N; 
or 44 
EN = 8N+you (NW). (49) 
Let us now put 4 
eee (50) 
vies 0 


The quantity y denotes the fraction of all nonconformists, including the 
“natural” or spontaneous ones and those that are raised as nonconformists 
by their parents. After simple transformations equations (49) and (50) 
give: dy 


pee 1— 5, 
di Yor ( y) 


(51) 
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If we consider that at a very early stage of human civilization y was 
negligibly small (this implies a very small yo), then equation (51) inte- 
grated with the initial condition y = 0 for ¢ = 0 gives 


y=1— emt, (52) 


Thus y gradually increases to unity. Actually it is not necessary for y to 
become close to unity in order that society exhibit rational behavior 
preponderantly. Due to the above-mentioned effects of imitation, a 
sudden ‘‘swing”’ toward the general acceptance of rational behavior will 
occur when y exceeds a threshold y*. From some semi-quantitative his- 
torical data, the value of y* has been estimated at approximately 0.1 
(Rashevsky, 1952a). 

The birth rate is varying between 10° year — and 10-* year —. If we 
assume wp ~ 10-? year —' and yo ~ 10-%, then it will take about ten thou- 
sand years to increase y from zero to 0.1. 

The mechanism of increase of y carries some implications, which may 
not be too plausible, though they are by no means impossible. 

The assumption of a very small initial y, that is, of a very small yo, 
leads to the following necessity. Let us say there is one nonconformist 
per hundred population. In order to raise his children as nonconformists 
he not only has to have a nonconformist mate, but he must isolate the 
children from the conformist influence of his neighbors who outweigh 
him numerically. This is not realistic. If yo is very small, say 10- or less, 
then in a society with small density of population and uniform spatial 
distribution, the probability of a nonconformist marrying another non- 
formist is very small. Even if this happens, their children are exposed to 
the influence of the average members of the society and raising them as 
nonconformists becomes practically impossible. In order to satisfy the 
assumptions implied in the derivation of equation (52), two conditions 
are necessary. First, the population density must be large enough to 
bring a sufficient number of nonconformists within a sufficient spatial 
proximity to each other, even for small yo. Second, the absolute number of 
nonconformists must be sufficiently large to form a more or less closed 
group in which the children may be raised with a minimum of interference 
from the rest of society. For example, a colony of 100 families may 
possibly isolate their children from outside influences. A group of two 
families can hardly do it. 

While the first condition may not be satisfied in very small rural com- 
munities, it is more likely to be satisfied in cities. The second condition, 
however, requires not only that the population density in the city be 
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large enough, but also that the city be large enough to contain the 
necessary minimum of nonconformists. Denote that number by g. Then if 
we consider, as a first approximation, all cities as having the same average 
size n, we must have 

yoneSg or mS, (53) 

Yo 

If condition (53) is satisfied, then the process described by equation (52) 
will take place in the cities. If V, is the total urban population, then of the 
total of yoV nonconformists born in the society, only yoV,,/N will be able 
to raise their progeny as nonconformists. This will have the same effect 
as if we substitute in equations (49) and (52) for yo the value 


: 
Yaa = UT (54) 

But if (53) is not satisfied, then no increase of the fraction of noncon- 
formists beyond the “natural” fraction yo will occur and society will not 
evolve. 

If we go back to equation (28), we see that 7, increases with the popula- 
tion density D. Initially D may be small and 7, may be too small to satisfy 
(53). If the population increases exponentially, then for a constant area, 
denoting by D, the initial population density, we have: 


D=D <é'. (55) 
Introducing (55) into (28), putting m. = g/yo, and solving for ¢, we find 
for the moment ¢* at which an evolution of society will begin: 


sae g{4pA— [4pAA— A cy (vv) ] E} 
eels ag A pA yo Do 00)? oa 


For sufficiently large values of A [cf. discussion following inequality 
(30)] ¢* decreases with £. Hence a larger shoreline will reduce ¢*, and cause 
an evolution of society to begin earlier. 

As we have said, however, the assumption of cities of uniform size 
is not realistic. If we use as a better approximation the distribution func- 
tion (40), then we see that even if the first condition is satisfied in all of 
them the second will be satisfied only in cities with a population greater 
than: g/¥o. The total population living in cities of the size g/¥o or larger 
is given by 

N= = 2s" ne dn= Nu (EB +1)e 1 Yorn (57) 
g/vo : 


Ne 


Of these individuals there are yo. born nonconformists who can raise 
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their progeny as nonconformists. Hence 
a eta Cao ae 2 

An increasing specific shoreline increases , and therefore increases 4,.. 

The other possible mechanism for an increase in number of noncon- 
formists seems to be more plausible. 

If there were no nonconformists at all, then every new generation of 
individuals would be taught the same old beliefs and prejudices. However, 
even if a very small number of nonconformists is present, then as long as 
they make their nonconformist behavior known in some way, they are 
bound to at least slightly affect the conditioning (by decreasing its in- 
tensity) of the new generations to be old beliefs. 

Putting 

eb eee ae (59) 
o 2: 


we may write expression (43) thus: 
= bb—X- (60) 


In complete absence of nonconformists the new generation will be taught 
the accepted beliefs with the same intensity as was the old generation. 
The value of bo will remain the same from generation to generation. On the 
other hand, if y = 1, that is, if all of the population would behave rational- 
ly, the new generation would not be taught any of the beliefs at all, and 
therefore b) would be zero. Hence for any value 0 < y < 1 the value of Bo 
for the newborn individuals will be 


bo [1—u(y)], (61) 
where u(y) is a function of y such that 
u(0O) =0; #(1) =1.. (62) 


The exact form of u(y) will largely depend on how well the whole 
population knows about the existence and behavior of the nonconformists. 
In a thinly and uniformly spread population, in which y is very small, a 
number of individuals will never have heard about the nonconformists. 
The effect of the nonconformists upon the young generation will be 
slight. In areas of heavy concentration of population the effect may be 
much stronger. 

The problem is closely related to the problem of rumor spread (Rapo- 
port and Rebhun, 1952), which is an offspring of the general theory of 
random nets developed by Anatol Rapoport and his associates in connec- 
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tion with their studies of the function of the central nervous system 
(Rapoport, 1948a, b, c; 1950a, b, c, d; 1951). 

The principal difference between the theory of rumor spread hitherto 
developed and the present problem of spread of information about the 
nonconformists is that in the former the rumor is considered as “planted” 
only once, whereas any nonconformist, if he lets it be known about him- 
self at all, is likely to keep on repeating his dissident views. He thus be- 
comes a continuous source of information rather than an instantaneous 
source, as studied in the theory of rumor spread. This, however, is not an 
essential difference, since the theory of continuous rumor “planting”’ is 
the next natural generalization of the present theory. In fact, one par- 
ticular case has already been studied by Landahl (1953). 

In his study of random nets with distance bias, Rapoport (1951) dis- 
cusses a situation which is likely to be especially applicable to our prob- 
lem. He considers an information net, which consists of individual groups, 
within which the information is transmitted very fully, so that all mem- 
bers of the group are completely informed about each other. The informa- 
tion between the groups, which are spatially separated, is some decreasing 
function of the distance. This model is very closely realized by a number 
of small communities which are scattered throughout the country. 

Landahl (1953) has studied a case in which for a community contained 
in the circumscribed area, the fraction of “knowers,” or informed indi- 
viduals, is proportional to the number of originators of information; in our 
case to y, to the density of population, and to the total population. If the 
conditions for which Landahl derives his equations are realized, then in 
those cases the effects of nonconformists upon the younger generations 
will be much stronger in larger cities than in small ones. Thus the distribu- 
tion of population between cities of different sizes should have a direct 
effect upon the influence of nonconformists and, therefore, upon the rate 
of evolution of the society. 

Thus the theory of history becomes closely related to the theory of 
random nets, itself an outgrowth of the mathematical biology of the brain. 

Pending further developments of the theory of random nets we shall 
now derive an equation for the rate of evolution of society, at first leaving 
the function u(y) unspecified beyond the conditions (62). 

If at a given moment f the society is characterized by a given y and a 
given ¢, then, according to our assumption, the individuals of the new 
generation, born between ¢ and ¢ + dz, will have a value ¢’ of ¢ given by 


$! = bo [1—u(y)] —do=G— bo (y) (63) 
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During the time dé a number uN of individuals are born with the 
value $’. Consider the “total”? value V¢. The contribution of the newly 
born individuals to the total value is uV¢’. During the same interval dt 
a number vJV of individuals die, removing vV¢ of the “total.” Hence 


oA Nel Sean (64) 
Because of (63) this gives us 
Since 
cv = BN, (66) 
therefore (65) becomes 
a 
co = — bu (y) . (67) 


We now must find a relation between ¢ and y. That relation is given by 
equation (45) in which we now put y for yo. The expression, however, is 
practically unmanageable. Expression (45) represents an approximate 
solution, which holds only for very small values of yo. The exact equation 
[Rashevsky, 1951b, eq. (30)] can be solved graphically. By thus con- 
structing y graphically as a function of ¢, we see (Fig. 1) that with suf- 
ficient accuracy we may represent y by the relation 


ke 


y=y*e- (68) 


where y* is the value of y for 6 = 0 and k is a properly chosen constant. 
In the absence of imitation y* = } (Rashevsky, 1951a). In the case of 
imitation we have 0 < y* < 4, and y* may be of the order of 10-2 (Ra- 
shevsky, 1952a). 

Expression (68) gives 
Pd 


=a 
Introducing this into (67) we obtain 
d 
= hu boyu (9) « (70) 


If we approximate u(y) by y, which is the simplest function that satisfies 
conditions (62), and integrate equation (70) with the initial condition 
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y = yo for ¢ = 0, we find: 
Yo 


ee eis Ba (71) 
For ¢ = 1/ukbovo, y = &. Since y cannot exceed 1 physically, therefore 
equation (71) holds only in the interval between f = 0 and 
1 — yo 
t= (ie 
in k bY ’ ( ) 


at which time y = 1. As we have said above (p. 218), a sudden transition 
to a state in which y is very close to 1 occurs, if imitation factors are 
present, when approximately y ~ 0.1, that is, for 


(73) 


= pk boo 


0.1 


5 —¢ 10 


i i d i i lution of 

Ficure 1. The full line represents yo as a function of ¢ obtained by a graphical so 
the exact equation from ath the approximation (45) is derived. The following values of the 
parameters were used: A/B = 0.12, a/B = 0.1. The broken line represents the curve yo = 


0.26¢e-*- 4, 


As remarked above, the form of u(y) is, in general, determined by the 
number of cities, by their sizes, and by the methods and efficiency of 
communication within and between them. If the population » of a city 
is less than 1/0, then on the average no nonconformists will be born in 
that city. Hence only communities whose sizes are greater than 1/¥o will 
produce born nonconformists. Using the distribution function (40) again, 
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we notice that the fraction of the urban population which lives in all such 
communities is 

a= (14) one (74) 

YoNe 

We shall now consider a hypothesis, which may not be too plausible but 
which leads to a possible theory of the effect of city size upon the rate of 
cultural development. Alternative hypotheses will be suggested later, and 
will be developed in subsequent papers. 

We shall assume that the information which the inhabitants of sparsely 
populated rural areas receive about each other directly is negligible. The 
bulk of information which they receive about each other comes via the 
city which is supplied by the given rural area. This assumption, while un- 
doubtedly not exact, is not altogether lacking in plausibility. Country peo- 
ple go to town for news of all sorts. With this assumption, rural areas be- 
longing to cities whose sizes are less than 1/9 and who do not contain 
any nonconformists will not know anything about them. Areas supplying 
larger cities will be informed about nonconformists. And here we shall 
make a less plausible simplifying hypothesis, namely, that in those in- 
formed areas the information is fairly complete, that is, the fraction of 
the informed individuals is of the order of unity. It may vary, for ex- 
ample, between 0.3 and 1. Since the rural areas are roughly proportional 
to the sizes of the cities which they supply, therefore ¢ measures approxi- 
mately the fraction of the total population which is informed about the 
nonconformists. 

If w = 0, there is no evolution of the society. On the other hand, when 
«@ = 1, we may have the situation represented by expression (71), which 
is derived on the basis of (63), in which we put u(y) = y. Hence u(y) 
should, in general, also be a function of the parameter w, u(y, w), such 
that for y < 1 we have u(y, 0) = u(0, w) = 0, u(1, w) = 1, and u(y, 1) 
= ¥, 

From the third requirement it follows that w(1, 0) = 1. This, together 
with the first requirement, shows that w(y, w) is not analytic at the point 
y = 1, w = 0, and therefore it cannot be expanded into a power series 
which will converge for all values 0 < y < 1;0 < w <1. It is, however, 
analytic at the point y = 0, w = 0, and therefore can be expanded into a 
power series for sufficiently small values of y and w. The first two require- 
ments show that the expansion is of the form 


u(y, @) = pd dinw'y* (75) 
ie ; 
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with doo = din = Ao = O. If the lowest term of the expansion is ay, then 
the requirement u(y, 1) = y requires only that a;, = 1, where 7 is the 
lowest power of w in the expansion. The expansion thus may begin with a 
term a@;,w*y. From physical considerations it is seen, however, that j = 1, 
because, as we said above, in the first approximation w measures the frac- 
tion of the total population which is informed about the nonconformists, 
and the effect of the nonconformists on dg, that is, u(y, w), is in the first 
approximation proportional to that fraction. 

Since we are interested in values of y which are = 0.1, we may use the 
first term of the expansion only, and put 


u(y,w) =o). (76) 


As an example of a possible exact expression for u(y, w) we may take 


wy 

Ty (16) a 
Introducting (76) into (70) we obtain after integration, with y = yo for 

t= 0, 

0 

7 = ere . (78) 
The distribution function (40) implies that there are cities of all sizes. 
In communities which are much smaller than 1/7, no nonconformist will 
ever be born, or, if one is occasionally born, it will be but once in several 
generations. A society which consists of only such communities will not 
develop at all, or will develop very slowly. Such conditions prevail among 
the primitive cultures. Even now the population of the largest of them, 
the Samoan, is about 40,000, and this is scattered over a number of small 
islands (Mead, 1949). Others number not over 2,000, and some even only 
a few hundred. The largest villages reported mow have about 500 in- 
habitants. Since the ancient cities of developing civilizations had about 
103 inhabitants, we may estimate yo at about 10-*. With the average 
population of the ancient cities also being of the order of 10°, we see that 
w, as given by (74), was of the order of 2 From equation (28) we have 
seen that a change in the specific shoreline by a factor of 10 changes ” by 
approximately 10%, and this will change by about 5% according to 
(74). The rate of evolution, as given by (78), will change in the same pro- 
portion. Thus a tenfold increase of the specific shoreline of western 
Europe, as compared to that of Asia, could account for the observed 
difference of about 5% in the rate of development of western Europe on 
the one hand and India and China on the other. We say “observed 5%,”’ 
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because India and China now seem to be beginning to catch up with the 
developments in western Europe of about 300 years ago. The difference 
is about 5% of the total span of approximately 7,000 years during which 
urban civilization existed. 

It must also be remembered that 7 and therefore w increases with time. 
However since we see that at the earliest stages w already was of the order 
of unity, and since its upper bound is one, therefore, in our equations w 
may be considered as a slowly varying parameter whose time derivative 
is negligible. 

The actual difference in the rate of development between western 
Europe and the East may be somewhat less than might be expected. There 
are several factors which tend to equalize these rates. Very likely the most 
important is the cultural exchange which existed between East and West 
even in ancient times. To treat western Europe as an entirely separate 
unit is, of course, unrealistic. It may be justified as a first approximation 
in the same way as the omission of the attraction of the earth by other 
planets is justified in the approximate treatment of the earth’s orbit. Of 
interaction and interdiffusion of cultures we shall say a few words below. 

Moreover, the effect of a greater specific shoreline in western Europe 
may have been partly offset by a possibly smaller population density D in 
the earliest stages of history. 

It must be emphasized that this possible offsetting effect of the popula- 
tion density D is a very weak point of the present model. In Section IV 
below we shall mention other models of the effect of the specific shoreline 
which are free from this shortcoming. In a subsequent paper, however, 
we shall show that a further elaboration of the model discussed here leads 
to an elimination of the difficulty. It may be noticed that while 7, is pro- 
portional to D [eq. (28)], 7, does not depend on D [eq. (35)]. The quantity 
7, is an important parameter in the interurban communication network, 
and under certain conditions the interurban communications may play a 
major role as compared with the urban-country communications. 

The circumstance that when y reaches approximately the value 0.1 a 
quasi-sudden change in behavior occurs tends to mar the proper perspec- 
tive in the estimation of the rates of development. From the very be- 
ginning and up to the critical time ¢* the difference in the rates of develop- 
ment shows very little. The general character of a civilization in which 
y = 0.01 is not much different from the character of one in which 
y = 0.05, In both cases the majority, which accepts arational behavior, is 
overwhelming. After the time ¢* a very rapid increase of y takes place and 
the original nonconformist behavior becomes socially accepted as stand- 
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ard. Therefore, for values of ¢ that are even slightly greater than ¢*, the 
difference between two civilizations will appear to be very large, though 
one may be only a relatively “‘short”’ time ahead of the other (cf. Rashev- 
sky,1952a). 

Now let us discuss the other parameters in equation (78). The birth 
rate u is likely to be of the order of 10-2 to 10-%. As for ds, it may be 
roughly estimated in the following manner. In the earliest stages of urban 
cultures, when completely arational behavior characterized society, it 
may be surmised that b) > X in expression (60), so that the initial 
value ¢; of ¢ was approximately ¢ ~ by at the beginning. But at ¢ = 0, 
y = yo and, therefore, using equation (68) 

Viet ets (79) 
or 


kbp = —log ge = log y*—log yp. (80) 


Let us first consider the case in which imitation is absent. Then y* = 3, 
With yo = 10-? we find 

k by — 6 . (8 1 ) 
The time at which society changes to preponderantly rational behavior is 
given by an equation similar to (73), namely, 


Parla Je 


*¥= F (82) 
LR bywo 
With the above values for w, yo, kbo, and yw, we find 
t* ~~ 104 years . (83) 


Imitation is likely to reduce y* to as little as 10-* (Rashevsky, 1952a). 


This will give, from (80), 
k by~ 2 25 ) 


which increases ¢* but does not change its order of magnitude. 

This agreement of the estimated order of magnitude of the evolution 
time from earliest urban civilizations to our days may certainly be con- 
sidered as interesting. 

According to the theory outlined, it is the largest cities that take, so to 
speak, the lead in cultural developments. Communities with 2 < 1/yo do 
not contribute directly to social evolution. This, however, holds only on 
the average, and cities whose population is in the neighborhood of 1/70 
may occasionally show a much more rapid rate of development toward 
abandonment of arational behavior. In communities for which 2 > 1/0, 
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the original number you of nonconformists is sufficiently large and rela- 
tively constant. If, however, x is only slightly greater than 1/+o, then you 
will be small and subject to high statistical fluctuations. The average rela- 
tive fluctuation is given by 


3 (em) = Jom (84) 


If yon = 1, the average relative fluctuation will be 100%. For yon = 2 
it will be 1/2 ~ 70%. For yon = 25, it will be only 20%. Thus it is high- 
ly probable that in some of the cities whose size is just about equal to 
1/yo there will be not one, but two, nonconformists. This will have the 
effect of doubling the initial yp and will result in a pronounced excess of 
for the city in the next generation. Such a city may develop much more 
rapidly than the bigger ones and may adopt a rational behavior consider- 
ably earlier than other cities. Since such a small community is out- 
weighed numerically by the others, any attempt at adopting a new be- 
havior is likely to be abortive until the time ¢* [given by (82)]. Neverthe- 
less the very first manifestations of mass discontent with the accepted 
behavior patterns are more likely to come from such small communities. 
The peasant revolts of the fourteenth century, as well as such “local” 
movements like Albiginses, Wycliffites, or even the Hussites, may be il- 
lustrations of the phenomenon discussed. It must also be kept in mind 
that w is not a constant, but increases with time as the population shifts 
to large communities. A number of social and economic factors are in- 
volved in this process. Mathematical approaches to it have been outlined 
elsewhere (Rashevsky, 1948a, 1952b). 

IV. Other possible effects of the shoreline. The efiect of the specific shore- 
line upon the rate of cultural development, as discussed in the preceding 
section, is a rather indirect one. By affecting the transport facilities, it 
affects the distribution of population, and the latter, by affecting the 
structure of the social communication net, affects the rate of cultural 
development. 

Other effects of the shoreline are, however, also conceivable. They shall 
be made the object of a separate study, and shall only be briefly men- 
tioned here. This does not mean, however, that we attach less sig- 
nificance to them. Different possibilities should be first studied ix abstracto 
without prejudice to one another. Only later can such a study lead to a 
decision as to which possibility, or any combination of them, is the more 
important. 


Widely separated places were much more easily reached by sea than by 
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land in olden times. A society which lived along the shore of a sea may 
have had much easier access to other societies living on the same shores. 
Such access, which facilitated commercial intercourse, also facilitated 
cultural intercourse. The latter, however, directly affects the quantities bo 
and ) in expression (60). The more an individual sees and knows the peo- 
ple different than his own and of behaviors different than those accepted 
by his society, the less strongly will he be conditioned to this behavior. 
Thus bo will be decreased. At the same time, extensive travel and knowl- 
edge of different countries and behaviors is likely to stimulate the reason- 
ing process of an individual, and thus increase \. The net effect will be 
that of reducing ¢ and increasing vo, thus increasing the rate of develop- 
ment. 

The Mediterranean basin, where Western civilization arose, is unique 
from the point of view of offering easy access by sea to a large number of 
different societies settled along its shores. An inhabitant of a tiny Pacific 
island may have a greater specific shoreline available, but he does not 
have within his reach such a diversity of societies and cultures as had the 
early inhabitant of the Mediterranean shores. 

Another possible, purely biological, factor may be affected by western 
Europe’s large specific sea shoreline. The sea provides us with iodine, 
which is essential for a normal functioning of the thyroid gland. In parts 
of the world like western Europe where the sea makes so many inroads, 
the content of iodine in air is bound to be larger than in places like the 
inland of China or India, which are much farther removed from the sea. 
It is estimated (De Castro, 1951) that at present about 25% of the 
population suffers from a sufficiently acute thyroid deficiency to produce 
goiter and cretinism. Even a milder deficiency is likely to affect such a 
purely biological function as reproduction. A glance at equations (78) 
and (82) shows that the fate of social development is directly proportional 
to the relative birth rate wu. 

Available figures for population growth in Europe and Asia for the last 
four centuries (Wright, 1942, Vol. I, p. 599) show a much larger relative 
increase in the population of Europe than of Asia. Whether this is due to a 
larger birth rate in Europe or to a smaller death rate cannot be said. Even 
if the birth rate were larger in the East, but the infant mortality were to 
outweigh it, the effect on cultural development would be the same as a 
decrease of u. In the derivation of equation (64) we tacitly assume that u 
does not vary too strongly with age. If on the uN newborn individuals the 
larger fraction does in infancy (before the age To at which they begin to be 
taught anything), the result is the same as a decrease of in equation (64). 
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Considering the important role that thyroid gland plays in reproduc- 
tion, it is not implausible that the relative remoteness of the sea in eastern 
Asia and India may have had a retarding effect on cultural development 
through an effect on the birth rate yw. A statistical analysis of available 
data on the iodine content of the air and of its correlation to manifesta- 
tions of thyroid deficiency may show whether the effect is strong enough 
to account quantitatively for the relative retardation of the rate of social 
evolution in the East. 

It is interesting that both effects discussed in this section affect the rate 
of social evolution in the same direction as the effect treated in more 
detail in the preceding section. 

V. Further possibilities. In conclusion we shall point out some other 
problems of history to which mathematical methods may be applied. 

Going back to “prehistoric”’ times, we find two important phenomena 
which may be better understood through mathematical analysis. One con- 
cerns the diffusion of mankind from its still hypothetic place or places of 
origin. This problem is only a special case of the more general problem of 
spread of any biological species. J. G. Skellam (1951) has shown that 
such a spread follows a diffusion equation of the form 


= DV-o+F (9) , (85) 


in which ¢ is the population density of the species, and F, in general, is a 
nonlinear function. The F(¢) term expresses the fact that the population 
which has migrated continues to reproduce at a rate which is a function 
of its size. 


The rather irregular shape of the boundary of the regions in which such 
a process of diffusion takes place makes any attempt at an exact solution 
of (85) in some cases hopeless. A recent generalization of Landahl’s 
(1953b) approximation method could be used to advantage in this prob- 
lem. It is true that present available archaeological data can hardly yield 
enough material to check any mathematica! conclusions. However, even 
if some approximate data of the first appearance of man at various geo- 
graphical points can be found, this may enable us to evaluate the order of 
magnitude of the diffusion coefficient D. This, in its turn, may give us 
some information as to the migrating habits of prehistoric man. The 
method would be applicable also to historic times when various “Volker- 
wanderungen”’ took place. These migrations were apparently from regions 
of higher population densities to regions of lower densities. They notice- 
ably became rarer and rarer, as should be expected, since those migrations 
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tend to level out differences in population densities. The exact equation 
which describes such phenomena is not likely to be a simple linear dif- 
fusion equation, quite apart from the production term. There seems to be 
a threshold in the “concentration gradient,”’ which must be exceeded be- 
fore a “diffusion” sets in. Mathematically, the problem offers some very 
interesting aspects. To the historian its solution may help to evaluate the 
magnitudes of past human migrations. 

Another “prehistoric” problem which deals with the diffusion equation 
is the spread of some cultural trait. To illustrate what can be done in this 
field, let us consider the following case as an illustration. 

It is estimated that certain prehistoric cultural practices (use of wheat) 
originated in the Middle East and only appeared in Europe about 3,000 
years later (Turner, 1941, Vol. I, p. 56). It might be supposed that this 
was produced not by an actual spread of population but by the spread of 
information. Landahl (1953b) has shown that such a phenomenon also 
may be described by a diffusion equation. Since we deal here only with 
crude approximations, we shall omit in our equations all numerical co- 
efficients of the order of unity. The diffusion coefficient D is then given by 


D~ fv, (86) 


where ¢ is the mean free path and v the average velocity of drift of a single 
individual. Let o denote the effective distance of constant between two 
individuals. If two individuals come as close together or closer than a, 
they will interact in some way and may exchange information. If 6 is 
the density of population, then 


(~—. (87) 


The distance s traveled by a diffusion front during the time / is 


sw vVDt. (88) 
In our case s ~ 10% km., ¢ ~ 10° years. Hence from equation (88): 
Dron 103 km.” year™. (89) 
Equations (86) and (84) give 
pee (90) 
oD 


If we estimate that for the sparse populations which prevailed in pre- 
historic times o was of the order of magnitude of the distance at which 
two individuals will see each other, then we may put ¢ ~ 1 km. As for 2, 
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it is not likely to be greater than 10? km. year. With those values and 
with the value for D given by (90), we find 
or 10, kn (91) 


In other words, there was one individual per 10 km.?, a density of popula- 
tion about 300 times smaller than the average density of population of the 
earth now. The figure is not implausible. However, this case is used, as we 
said, for illustration only. Our calculations do not prove anything. But the 
example illustrates what could be done if we had an estimate of o and 2. 
That such estimates may somehow be provided is not precluded. 

Now let us pass to historic times proper. In this paper we discussed the 
gradual change from a behavior characterized by acceptance on faith to 
one characterized by rational analysis. We have seen that the earlier an 
individual is conditioned to some belief, the more difficult it is for him to 
abandon it. In every society the earliest belief taught to a child is the gen- 
eral belief in some deity and other supernatural ideas. Various prejudices 
and beliefs of a social nature, for example, class superiority or racial 
prejudices are usually conditioned at a later age. For those prejudices ¢ is 
less positive, according to equation (43), and therefore yo is greater, ac- 
cording to (45) or to (68). Hence, according to (73), it will take less time 
for society to abandon those social prejudices than the earlier conditioned 
religious ones. 

The establishment of some accepted behavior pattern by a social 
group has, at least in the past (and probably mostly now also), been of 
the nature of a dictum to be accepted without too much critical analysis. 
Those patterns may be of a social nature or of some cultural kind. The 
duration of the acceptance of such a behavior pattern will again be deter- 
mined by equation (73). Since yo is a function of To, the average age at 
which the conditioning begins, we may from known durations calculate 
7) and thus gain an insight into the methods of bringing up children during 
different historical periods. This certainly would add to our knowledge of 
the past. 

In our previous studies (Rashevsky, 1948a, 1951a) we have investi- 
gated the problem of duration of behavior patterns characterized by the 
control of behavior by a small “ruling” group. Durations of different 
dynasties may, for example, fall within the scope of that study. This 
duration is a function of the rate of population growth, of certain other 
parameters which may be estimated, and of the coefficient of social mobil- 
ity. This coefficient roughly indicates what percentage of individuals, born 
in the nonruling class, eventually climb up the social ladder and obtain 
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prominence as members of the ruling class. The higher the social mobility, 
the longer the duration of the ruling period. The possibility exists that 
from estimated values of other parameters and from the durations of dif- 
ferent dynasties, it may be possible to evaluate the coefficient of social 
mobility, which again will give us important knowledge of the behavior of 
the past. 

The author is under no illusion as to any ready acceptance by historians 
of this method of calculating past events of which we have no direct in- 
formation. Yet such things as the distance from the earth to the sun or the 
mass of the sun can be found only by calculations. Nevertheless they form 
one of the most certain parts of our knowledge. Mathematics may come 
to serve history by “postdicting”’ past events, just as it “predicts” future 
events in physical sciences. 

In his comparative study of civilizations, Arnold Toynbee (1935-39; 
1947) has pointed out the general pattern of development of civilizations, 
which begins with a “time of troubles,” followed by a “universal state”’ 
and “universal church,’ which, in its turn, through a process of disinte- 
gration, leads to a new time of trouble. Very roughly, we may identify 
Toynbee’s “times of troubles’ with periods of non-integrated behavior, 
while the “universal state” is characterized by a rather integrated be- 
havior, where certain behavior patterns are generally accepted. If this is 
so, then the changes studied by Toynbee may perhaps be illustrations of 
the changes from one accepted behavior to another, which we have 
studied mathematically (Rashevsky, 1951a). The transition periods may 
perhaps be sometimes identified with the “time of trouble.” Such transi- 
tion periods as we have studied are of rather short duration. It is, however, 
possible under certain conditions to obtain much longer periods of non- 
integrated behavior. 

The author is indebted to Drs. Anatol Rapoport and George Karre- 
man for a critical reading and discussion of this manuscript. 
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